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Intelligent  Automated  Numerical  Control  Programming 
Systems  Research  Task  3.1  Final  Report 


EXECUTIVE  SUMMARY 

Effective  storage,  processing  and  exchange  of  product  data  is  essential  to  improving 
manufacturing  operations.  National  efforts  to  standardize  the  manner  in  which  a  product 
is  described  include  efforts  by  the  Computer  Aided  Logistics  Support  (CALS),  the 
International  Graphics  Exchange  Standard  (IGES),  and  the  Product  Definition  Exchange 
Specification/Standard  for  Exchange  of  Product  Data  (PDES/STEP).  These  efforts 
guided  the  manner  in  which  the  project  progressed.  Prof.  Leon  Cox  participated  in  the 
’94  Air  Force  CALS  Testbed  (AFCTB)  project  utilizing  IGES  V5.1  translator  to 
download  product  data  for  use  with  the  developing  Turning  Assistant  to  develop  the  CNC 
program  for  part  number  7307165  pin  and  shipped  the  finished  pin  to  the  CALS 
conference  for  the  assembly  demonstration  with  no  noted  problems.  Dr.  David  Culler 
attended  the  CALS  conference.  Prof.  Cox  attended  Autofact  conference  and  discussed 
PDES/STEP  efforts  with  Gary  Patelski  of  General  Motors  (GM),  who  is  heading  efforts 
there. 

The  dissertation  work  by  Dr.  David  Culler  completes  the  objectives  to  develop  an 
automated  turning  package  capable  of  handling  a  variety  of  rotational  features  that  can  be 
extended  to  other  machining  and  process  planning  activities.  The  work  by  Prof.  Leon  Cox 
and  Dr.  Amjed  Al-Ghanim  in  artificial  neural  networks  (ANN’s)  addresses  the  objective  to 
develop  a  formalized  knowledge  base  of  “industrial  strength”  turning  strategies.  As  the 
several  papers  point  out  are  useful  tools  for  collecting,  storing  and  utilizing  ANN’s  in  an 
automated  CNC  programming  environment.  Further,  the  turning  assistant  was  able  to 
make  calls  to  the  ANN’s  and  obtain  parameters  useful  in  the  process  planning  of  a  product 
manufacture.  This  work  can  never  completely  reach  completion  to  provide  a  complete 
knowledge  base  because  there  is  a  limitless  supply  of  data  from  machinists;  however,  the 
format  by  which  the  ANN’s  have  been  developed  provide  the  means  for  individual 
companies  to  develop  a  rich  subset.  And  finally,  work  by  Kathy  Lueders,  MSIE 
candidate,  to  develop  links  to  commercial  knowledge  bases  is  progressing  and  an 
unpublished  paper  describing  her  work  is  included. 

PAPERS  INCLUDED 


Feb.  94,  SYSCAP  Proceedings:  Cox,  Al-Ghanim,  "A  Neural  Network  Approach  for 
Computer-Aided  Process  Planning  of  CNC  Milling  Parameters  and  Strategies 

May  94,  IERC  Proceedings:  Cox,  Al-Ghanim,  "Neural  Networks  Applied  to  Computer- 
Aided  Process  Planning". 


May  94,  Ph.D.  Dissertation,  Culler,  David  E.,  “The  Turning  Assistant:  Automated 
Planning  for  Numerical  Control  Lathe  Operations”. 


1  -  2 


Feb.  95,  Proceeding  of  the  1st  World  Congress  on  Intelligent  Manufacturing  Processes 
and  Systems:  Cox,  Al-Ghanim,  Culler,  “Knowledge  Acquisition  Techniques  for  Intelligent 
Machining  Assistants”. 

Mar.  95,  Proceedings  of  the  XVII  Conference  on  Computers  and  Industrial  Engineering: 
Cox,  Al-Ghanim,  Culler,  “A  Neural  Network-Based  Methodology  for  Machining 
Knowledge  Acquisition”.  Also  appears  in  Pergamon  Press  Computers  industrial 
engineering.  Vol.  29  pp  217-220,  1995. 

Nov.  95,  Intelligent  Engineering  Systems  Through  Artificial  Neural  Networks.  Cox,  Al- 
Ghanim,  Culler,  “Machining  Strategies  Collected,  Stored  and  Utilized  with  Artificial  Neural 
Networks”,  ISBN  0-7918-0048-2,  ASME  Press,  1995. 

Dec.  95,  Cox,  Al-Ghanim,  “A  Process  Planning  Technique  Utilizing  Neural  Networks  to 
Determine  CNC  Milling,  Drilling  and  Turning  Parameters  and  Strategies”,  (Abstract,  Not 
Published). 

Dec.  95,  Lueders,  Kathryn,  “Utilization  of  Knowledge  Base,  Process  Planning  Tool  and 
Neural  Net  Inputs  to  a  Machining  System”,  (Course  paper  and  basis  for  Thesis). 
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TASK  3.1 


PUBLICATIONS  AND  TECHNICAL  REPORTS  PUBLISHED 


A  Neural  Network  Approach  for  Computer-Aided 
Process  Planning  of  CNC  Milling  Parameters  and  Strategies 

Amjed  Al-Ghanim,  PhD  Candidate  and  Leon  Cox,  Assoc.  Prof. 

New  Mexico  State  University 
Department  of  Industrial  Engineering 

ABSTRACT 

Artificial  neural  systems  are  becoming  an  integral  part  of  the  computer  integrated 
manufacturing  environments,  especially  in  a  market  place  where  improvements  must  take  place  in 
the  form  of  increased  productivity.  The  function  of  process  planning  is  a  prominent  one  that  has 
direct  impact  on  overall  manufacturing  productivity.  This  paper  is  an  initial  investigation  into  the 
potential  of  applying  artificial  neural  paradigm  to  the  selection  of  machining  parameters  of  a  milling 
process.  The  viability  of  this  approach  stems  from  the  ability  of  neural  nets  to  handle  ill-structured 
problems,  and  their  capability  of  generalization.  Our  simulation  results,  applied  to  tool  material 
selection,  show  a  high  potential  for  the  development  of  neural  network  modules  for  similar  process 
planning  functions. 

1.0  INTRODUCTION 

Process  Planning  (PP)  can  be  defined  as  the  subsystem  responsible  for  converting  design 
data  into  work  instructions.  It  represents  the  link  between  engineering  design  and  shop  floor 
manufacturing,  and  determines  the  manufacturing  operations  required  to  transform  a  part  from  a 
rough  state  to  a  finished  state  specified  by  the  engineering  drawing.  More  specifically,  it  is  a 
function  which  maps  the  design  features  to  manufacturing  features  [1].  As  related  to  machining, 
process  planning  consists  of  a  series  of  tasks  interpreting  the  product  model  including  selection  of 
machine  tools,  tool  sets,  setups  (e.g.  design  of  fixtures),  and  machining  sequences  (e.g.  generation 
of  NC  programs)  [2],  As  such,  process  planning  is  an  involved  activity  that  has  a  large  number  of 
input  variables  describing  the  part  to  be  produced  and  the  production  resources. 

As  established,  there  are  two  systematic  computer-aided  process  planning  strategies;  namely 
the  variant  planning  and  the  generative  planning.  The  distinguish  ing  feature  of  the  variant 
planning  strategy  is  that  former  plans  are  retrieved  and  modified  for  new  parts,  while  generative 
planning  is  a  strategy  that  strives  to  create  a  new  plan  for  a  part  from  scratch  based  on  analyzing 
part  geometry  and  other  related  specifications  [1]. 

Artificial  Intelligence  (AI)  technology,  Group  Technology  (GT),  [3],  and  Expert  Systems 
(ES)  technology  [4,5,6]  have  been  employed  to  assist  process  planning.  Rule-based  expert  systems 
based  on  experience  extracted  from  machinists  have  been  built  to  generate  process  plans.  Various 
methodologies  have  been  employed  to  implement  generative  CAPP  expert  systems  including  among 
others,  case-based  reasoning  [2],  feature-based  recognition  [7,8],  and  solid  modeling  techniques 
[9,10]. 

If  a  product  model  existed  that  contained  all  processing  parameters  necessary  for  the 
manufacture  of  the  product  then  a  subset  of  these  would  contain  information  regarding  any 
machining  that  would  need  to  take  place  during  a  particular  interval  of  development.  Virtually  all 
products  require  a  chip  producing  machine  tool  in  some  stage  of  manufacture.  Today  much  of  the 
chip  producing  is  being  accomplished  through  computer  numerical  control  (CNC). 
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In  most  cases  during  the  planning  for  the  CNC  machining  operations,  it  is  necessary  to 
consult  an  expert.  Whether  that  expert  is  a  machinist,  part  programmer  or  a  knowledge  base,  the 
specific  set-up  and  run  parameters  for  the  machining  process  must  be  explicitly  determined. 

We  are  focusing  on  the  ability  of  the  human  (and  in  the  long  run  the  ability  of  a  pattern 
recognition  system)  to  identify  a  feature  that  needs  to  be  machined  such  as  a  pocket,  notch,  slot, 
etc.  Once  identified  that  feature  will  have  parameters  and  strategies  developed  specific  to  its 
nature.  Thereby,  taking  the  larger  problem  and  simplifying  it. 

Most  research  efforts  as  described  have  focused  on  utilizing  expert  systems  technology  to 
implement  process  planning  systems.  These  systems  offer  assistance  based  on  capabilities  tanging 
from  fast  retrieval  of  existing  plans  to  generating  plans  based  on  interpreting  product  model 
geometry.  Actually,  expert  systems  have  shown  success  in  automating  certain  aspects  of  process 
planning.  However,  with  the  increasing  level  of  complexity  of  manufacturing  environments,  and 
the  increasing  demand  for  achieving  levels  of  integration,  expert  systems  methodology  suffers  some 
limitations.  First,  when  domain  knowledge  is  complex  and  intuitive  as  in  the  case  of  process 
planning,  the  knowledge  acquisition  phase  of  system  development  becomes  a  real  bottleneck 
because  it  is  not  always  possible  to  extract  and  represent  knowledge  in  an  explicit  form  [2,6,11]. 
Second,  when  the  system  rule  base  gets  larger,  the  knowledge  validation  process  becomes  difficult 
and  prone  to  errors  [12].  Finally,  a  dynamic  manufacturing  environment  requires  utilization  of  fast 
adaptive  systems.  However,  expert  systems,  due  to  explicit  knowledge  representation,  show 
limited  adaptive  capabilities,  and  they  do  not  tolerate  missing  or  inaccurate  data. 

To  compensate  for  some  of  these  limitations  and  enhance  the  expert  systems  approach,  this 
paper  describes  a  methodology  of  process  planning  for  CNC  milling  based  on  the  concept  of 
artificial  neural  networks.  First,  process  planning  is  cast  as  a  pattern  recognition  problem,  and 
viewed  as  a  mapping  function  between  design  features  and  machining  features.  Second,  a  neural 
network  model  is  developed  for  automating  certain  aspects  of  process  planning  decisions  such  as 
cutting  tool  material,  tool  entry  method,  and  cutting  strategies.  Simulation  results  are  shown  for 
appropriate  network  parameters. 

2.0  ARTIFICIAL  NEURAL  NETWORKS 

There  has  been  an  increase  of  interest  in  'brain  style  computing'  in  terms  of  artificial  neural 
networks  (ANNs).  ANNs  are  parallel  distributed  processing  architectures  composed  of  a  huge 
number  of  small  interacting  elements  that  are  massively  interconnected.  Each  of  these  elements 
sends  excitatory  and  inhibitory  signals  to  other  units  that  in  turn  update  their  behaviors  based  on 
these  received  messages.  ANNs  emulate  the  functionality  of  the  human  brain  that  implicitly  stores 
knowledge  in  the  interconnection  weights,  and  not  in  the  neurons  themselves  [13].  Learning  is  thus 
achieved  by  modifying  the  connection  weights  between  elements.  Feedforward  neural  nets  acquire 
knowledge  through  training,  where  by  repeatedly  presenting  sample  cases  to  the  net,  its 
interconnection  weights  change  accordingly  to  model  the  representation  of  the  cases  [14]. 
Knowledge  is  stored  in  the  final  interconnections  values.  As  such  neural  network  methodology 
would  ease  the  knowledge  acquisition  bottleneck  that  is  hampering  the  creation  of  expert  system 
[15].  The  key  to  a  neural  network's  power  is  the  inherent  parallelism  resulting  in  fast  computation, 
robustness  or  error  resistance,  and  adaptiveness  or  generalization.  A  typical  network  consists  of  a 
set  of  processing  elements  (PE)  grouped  hierarchically  in  layers  and  interconnected  in  some 
fashion.  These  PEs  sum  N  weighted  inputs  and  transfer  that  outcome  through  a  mathematical 
function.  Figure  1  shows  a  processing  element  and  a  three  layer  neural  network.  One  of  the  most 
popular  neural  paradigms,  and  the  one  that  has  been  used  in  this  project,  is  the  multilayer 


1  -  6 


perceptron  trained  using  the  Error  Back  Propagation  Training  Algorithm  (EBPTA)  [16].  Within 
the  manufacturing  environment  ANNs  have  been  applied  to  quality  control,  process  control,  part- 
family  classification  [17,18,19,20,21]  Neural  network  models  have  been  also  used  for  optimizing 
cutting  parameters  of  turning  operations  [22]. 


Figure  1.  Generic  Neural  Network  Components 
2.1  NEURAL  NETWORK  DESIGN  CONSIDERATIONS 

One  of  the  advantages  of  the  backpropagation  algorithm  is  its  applicability  to  a  large 
number  of  network  design  architectures;  that  is,  the  choices  for  the  number  of  layers, 
interconnections,  learning  constants,  and  data  representation  [16].  The  problem  of  selecting 
network  parameters  is  under  intensive  research  and  there  are  no  conclusive  answers  available  [15]. 
Analysis  of  this  problem  takes  the  form  of  experimenting  with  different  design  choices  and 
performing  simulations,  the  results  of  which  give  guidelines  for  final  parameter  selection.  The 
design  factors  considered  here  are:  learning  rate,  and  number  of  neurons  in  a  hidden  layer.  The 
number  of  layers  in  a  network  can  only  be  determined  through  simulation.  However,  the  literature 
suggests  a  maximum  of  three  to  four  layers  (i.e.  one  to  two  hidden  layers);  as  more  than  three 
layers  will  improve  performance  very  little  at  the  expense  of  high  convergence  time  [24].  The 
number  of  input  neurons  is  simply  identical  to  the  dimension  size  of  the  input  vector.  In  cases 
where  the  number  of  input  neurons  is  less  than  input  vector  size,  then  the  number  of  training 
samples  has  to  be  increased.  The  number  of  output  neurons  is  usually  selected  to  be  the  number  of 
classes.  In  such  a  scheme,  called  local  representation,  the  network  performs  as  a  decoder.  The 
number  of  neurons  in  the  hidden  layer,  7,  can  be  determined  by  a  simple  (empirical)  formula 
relating  the  number  of  separable  decision  regions,  M,  the  size  of  the  input  vector  n.  [15]. 
However,  in  cases  where  the  number  of  decision  regions  is  not  known  (as  in  this  case),  the  number 
of  hidden  neurons  can  initially  be  taken  to  be  equal  half  the  input  vector  dimension,  n.  To  provide 
a  satisfactory  level  of  class  separation  each  decision  class  should  have  enough  number  of  examples 
(i.e.  5-10). 

The  effectiveness  and  convergence  of  the  backpropagation  algorithm  depends  significantly 
on  the  appropriate  choice  of  the  learning  constant  (r|).  In  general,  the  optimum  value  of  (ri) 
depends  on  the  problem  being  solved,  and  so  can  be  determined  experimentally.  A  true  learning 
strategy  calls  for  small  values  of  (rj),  which  will  consequently  increase  convergence  time.  Larger 
values  of  (r|)  may  cause  the  algorithm  to  overshoot  and  oscillate,  thus  preventing  convergence.  The 
momentum  factor  can  be  introduced  to  alleviate  this  problem,  by  allowing  smaller  values  of  (t|) 
and  a  faster  convergence  process. 

3.0  PROCESS  PLANNING  :  A  PATTERN  RECOGNITION  PROBLEM 

Pattern  Recognition  (PR)  can  be  characterized  as  an  information  mapping  process  taking 
place  over  a  set  of  (metric/non-metric)  spaces  [14].  An  abstract  view  of  the  PR  description 
problem  is  shown  below  in  Figure  2.  A  relationship  Gt  stands  as  a  mapping  between  class- 
membership  space,  C,  and  a  pattern  space,  P.  Each  Class  vv;  corresponds  to  a  subset  of  patterns  in 
the  pattern  space,  where  the  /*  pattern  is  denoted  as  p{.  These  pattern  spaces  may  overlap  allowing 
patterns  from  different  classes  to  share  the  same  attributes.  Figure  2  shows  also  another  mapping 
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from  the  pattern  space  to  an  observation  space,  with  feature  patterns  mh  via  the  relationship  M.  All 
patterns  pertaining  to  the  different  spaces  are  represented  in  an  abstract  form  by  vectors,  and  any 
given  space  may  assume  a  specific  dimension,  n,  and  thus  be  represented  generically  as  U". 

A  relationship  can  formally  be  defined  as  M:  Rn—>Rm]  that  is  a  mapping  from  a  space  of 
dimension  n  to  a  space  of  dimension  m.  Using  this  concept,  the  characterization  of  many  PR 
problems  is  simply  that,  given  a  observation  vector  mh  a  method  is  desired  to  identify  and  invert 
mapping  M  and  Gt  for  all  i.  As  seen  from  Figure  2,  different  pattern  vectors  may  be  associated 
with  the  same  observation  vector,  and  single  class  membership  can  point  to  more  that  one  pattern. 
This  characteristic  suggests  a  decision  problem  with  ambiguity  or  vagueness.  Pattern  recognition 
techniques;  namely  statistical,  structural,  and  artificial  neural  are  employed  to  overcome  vagueness. 
Depending  on  the  nature  of  the  problem  (the  pattern  generating  environment)  and  on  how  well  it 
can  be  structured,  a  suitable  PR  technique  can  be  applied.  For  example,  statistical  PR  can  be  used 
when  patterns  come  from  a  statistically  characterized  environment.  However,  quantitative  precise 
descriptions  of  some  complex  decision  problems  may  be  largely  unknown  and  a  detailed 
characterization  may  be  hopeless.  A  viable  alternative  is  to  treat  the  problem  from  an  input/output 
or  'black  box’  viewpoint  [14].  A  Neural  pattern  recognizer  is  such  a  good  example  of  black  box 
systems.  Because  process  planning  (PP)  is  such  an  ambiguous  and  ill-structured  problem  (at  least 
in  certain  aspects),  it  seems  reasonable  to  approach  PP  using  neural  network  PR  techniques. 


Figure  2.  PR:  Mapping  in  an  Abstract  Form 

Process  planning  can  be  cast  as  a  pattern  recognition  problem;  that  is  set  of  mapping  from 
one  space(s)  to  another  space(s).  First,  as  explained  earlier,  process  planning  for  CNC  machining 
is  a  multivariable  input/output  process,  including  among  others  part  dimensions,  material,  surface 
finish,  fixture  rigidity,  tool  material,  spindle  speed,  feed  rate,  depth  of  cut,  etc..  Therefore,  to 
facilitate  analysis,  one  should  precisely  identify  the  output  decision  quantities  and  corresponding 
input  decision  quantities.  In  this  case,  as  related  to  milling,  the  output  decision  variable  is  cutting 
tool  material  type.  Other  factors  being  considered  are  the  strategy  for  entry  of  the  tool  into  the 
material  and  the  cutting  strategy  once  the  material  is  entered. 

The  input  decision  variables  pertaining  to  the  part  to  be  manufactured  are:  feature  types, 
perimeter,  length,  width,  depth,  comer  radius,  material,  tolerance,  and  finish  as  shown  in  figure  3. 
Other  input  decision  variables  are  machinist  preference  and  delivery  requirement  of  the  part.  Each 
of  the  output  decision  variables  depends  on  a  mixture  of  the  input  variables,  and  possibly  on  some 
other  input  variables.  The  mixture  of  the  input  variables  forms  the  input  space,  or  in  pattern 
recognition  terminology  the  observation  space.  The  output  variables  forms  the  output  space,  or 
likewise  the  decision  space.  The  idea  of  the  'black  box'  is  to  collapse  the  effects  of  individual  input 
variables  into  a  combined  input  effect,  with  no  major  emphasis  on  any  particular  input  variable. 
The  concept  permits  capturing  inexact  relationships  between  individual  inputs  and  outputs.  Figure 
4  is  a  schematic  view  of  a  'black  box'  of  a  sub-process  planner.  Here,  the  output  decision  variable 
is  the  cutting  tool  material. 
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Figure  3.  A  Pocket  Feature  Figure  4.  A  'Black  Box'  View  of  the 

Tool  Material  Type  Selector 

4.0  A  NEURAL  NETWORK  PROCESS  PLANNER 

The  research  methodology  adopted  aimed  at  exploring  the  utilization  of  neural  network 
models  as  supportive  decision-making  tools  in  machining  process  planning.  To  carry  out  this 
objective,  process  parameters  have  been  identified,  as  related  to  milling  processes.  Table  1  shows 
the  input  and  output  parameters  for  three  network  models. 

Table  1.  Neural  Network  Machining  Strategy  Models 


Network  1 .  Output  decision  variable:  Tool  material  type 
Values  (classes)  of  output 

High  speed  steel 

Carbide 

Ceramic 

Input  variables:  Feature  type 

Depth  of  cut 
Part  material  hardness 
Machinability 
Fixture  rigidity 

M/C  max  rpm,  feed,  horsepower 

Network  2.  Output  decision  variable:  Entry  strategy 
Values  (classes)  of  output: 

Free  entry 
Drill  hole 
Ramp 
Plunge 
Helix 

Input  variables: 

Feature  type,  #  of  Islands,  smallest  gap 
Feature  Width,  Depth,  Length,  Perimeter 
Part  material  hardness 
Fixture  rigidity 

M/C  max  rpm,  feed,  horsepower 
Machinability 
Surrounding  geometry 
Geometry  near  entry. 

Network  3 .  Output  decision  variable:  Cutting  strategy 
Values  (classes)  of  output: 

Collapse- in 
Collapse-out 
Constant  angle 
Finish 
Rough 

Finish  and  rough 

Input  variables: 

Feature  type,  #  of  Islands,  smallest  gap 
Feature  Length,  Width,  Depth,  Perimeter 
Comer  rad(convex) 

Comer  rad(concave) 

Part  matrl  hardness 
Side  tolerance 
Surface  finish  rms 
Fixture  rigidity 

M/C  max  rpm,  feed,  horsepower 
Machinability 
Surrounding  geometry 
Geometry  near  entry 
Base  tolerance 
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It  should  be  noted  that  the  output  decision  variables  were  chosen  such  that  their  values  can 
be  grouped  into  a  limited  number  of  classes.  Actually,  this  the  basic  idea  of  utilizing  neural  nets  as 
pattern  recognizers.  During  the  learning  phase,  the  network  learns  all  possible  classes  and 
corresponding  regions,  and  consequently,  during  the  testing  phase,  the  network  recognizes  a  new 
input  case  (vector)  by  assigning  it  to  the  closest  decision  class.  A  new  case  represented  to  the 
network  as  a  vector  may  be  corrupted  by  noise  ( e.g.  machinist  preference);  however,  the  network 
will  recommend  the  closest  possible  solution  in  terms  of  the  output  variable. 

The  development  process  of  the  network  consists  of  two  main  steps.  First,  a  preliminary 
data  set  was  obtained,  and  an  initial  network  architecture  was  selected  for  training  to  make  sure  that 
a  neural  model  of  the  problem  can  be  generated.  Second,  a  sensitivity  analysis  procedure  was 
conducted  to  determine  optimal  network  design  parameters.  The  data  collection  phase  was  carried 
out  in  Integrated  Manufacturing  Systems  Laboratory  at  New  Mexico  State  University  from 
interfiews  with  machinists.  Forty  examples  were  constructed  for  each  decision  output  variable  (i.e. 
about  10-15  examples  for  every  output  class)  which  provided  a  good  level  of  discrimination 
between  classes. 

As  shown  in  Table  1,  input  variables  range  in  type  and  form.  Some  variables  can 
intrinsically  have  numeric  values  (e.g.  depth,  hardness),  while  others  (e.g.  machinability  and 
fixture  rigidity)  can  best  be  described  in  linguistic  form,  such  as  high,  medium  or  low.  Since 
neural  nets  only  operate  on  numeric  data,  such  variables  have  been  quantified  so  that  they  have 
numeric  values  between  0  and  1.  The  values  of  variables  can  range  from  .0001's  (for  feed)  to 
lOk’s  (for  machine  ipm).  During  training  of  a  network,  the  wide  range  can  introduce  a  large 
amount  of  bias  in  the  values  of  connection  weights.  To  alleviate  this  problem,  all  input  variables 
have  normalized  so  that  their  values  fall  in  the  range  0  and  1.  This  range  is  actually  a  preferable 
choice  for  network  operation;  as  the  activation  function  (unipolar  sigmoid)  assumes  the  same  range, 
thus  resulting  in  moderate  values  of  the  weights.  The  cuttoff  value  for  classification  is  (.80). 

5.0  NETWORK  TRAINING  AND  RESULTS 

For  this  paper  only  the  results  of  the  tool  material  selection  strategy  are  presented  since  it  is 
a  primary  decision.  Training  was  carried  out  using  a  two-layer  network  with  a  momentum  factor 
of  (.9),  a  learning  factor  of  (.  1),  and  target  error  of  (5%).  Several  runs  were  conducted  for  various 
numbers  of  hidden  neurons  as  shown  in  Table  2,  for  the  tool  material  network.  Guided  by  these 
results  the  number  of  hidden  neurons  was  selected  to  be  16. 

Table  2  #  of  neuron  vs.  convergence  cycles  (T|  = .  1) 


network  #  #  of  hidden  neurons  #  of  converg.  cycles 


1. 

5 

1442 

2. 

8 

2161 

3. 

11 

1461 

4. 

13 

1156 

5. 

16 

817 

Then  using  this  number  of  hidden  neurons,  the  network  was  retrained  with  different  values  of 


the  learning  constant,  as  shown  in  Table  3. 

Table  3  Learning  constant  vs.  convergence  cycles  (  hidden  neurons =15) 


learning  constant^)  convergence  cycles 


0.10 

0.20 

0.30 
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817 

631 

501 


0.35 

0.40 


593 

1034 


One  of  the  important  performance  measures  of  network  operation  is  the  percentage  of  classification 
errors.  The  networks  designed  were  tested  first,  using  the  training  examples,  and  second,  using  a 
new  set  of  20  known  cases.  The  performance  of  the  network  exceeded  the  expectations;  as  the 
network  was  unable  to  recognize  at  most  two  of  the  forty  training  examples,  for  different  learning 
values.  When  used  in  the  testing  mode,  the  network  performance  ranges  from  a  low  69%  for  a 
learning  constant  of  .30  to  a  high  92%  for  a  learning  constant  of  0.10  correct  classifications  of  the 
exposed  cases.  These  results  are  shown  in  Table  4.  These  results  are  obtained  for  a  target  error 
value  of  (5%). 


(Training  examples) 


(Testing  examples) 


6.0  CONCLUSION 


Table  4  Network  classification  performance 

hidden  nodes 

learning  constant^) 

correct  classification  (%) 

5 

.10 

95% 

11 

.10 

95% 

16 

.10 

100% 

5 

.20 

90% 

11 

.20 

95% 

16 

.20 

97% 

5 

.30 

95% 

11 

.30 

95% 

16 

.30 

95% 

5 

.10 

88% 

11 

.10 

90% 

16 

.10 

92% 

5 

.20 

80% 

11 

.20 

85% 

16 

.20 

88% 

5 

.30 

80% 

11 

.30 

76% 

16 

.30 

69% 

We  have  shown  the  viability  of  utilizing  the  Neural  Networks  approach  for  assisting  the 
process  planner  or  CNC  programmer  in  obtaining  machining  set-ups  and  strategies.  We  were  able 
to  handle  the  uncertainty  and  vagueness  that  is  inherent  in  the  minds  of  the  "experts".  It  also  shows 
strong  promise  as  a  method  of  storing  knowledge  and  being  retrievable  in  a  rapid  manner.  As  new 
examples  become  proven  methodology  for  an  industry,  the  Neural  Net  is  easily  retrained  to  handle 
updated  information.  The  optimal  design  of  a  Neural  Network  architecture  is  achieved  through 
running  simulations  and  in  our  case  the  "best"  was  sixteen  hidden  nodes  and  0. 1  learning  factor. 
Future  work  is  planned  in  fuzzifying  the  linguistic  type  inputs  and  verifying  the  strategic  outputs  on 
CNC  machine  tools. 
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ABSTRACT  to  implement  generative  CAPP  expert  systems  including 

among  others,  case-based  reasoning  [2],  feature-based 
The  function  of  process  planning  is  a  prominent  one  recognition  [7,8],  and  solid  modeling  techniques  [9,10], 
that  has  direct  impact  on  overall  manufacturing  Expert  systems  have  been  shown  to  have  specific 
productivity.  This  paper  presents  results  from  limitations  in  these  areas.  [11,12] 
preliminary  investigations  applying  artificial  neural 

networks  ability  to  select  machining  parameters  of  a  Virtually  all  products  require  a  chip  producing  machine 
milling  process  and  as  a  storage  medium  for  the  tool  in  some  stage  of  manufacture.  Today  much  of  the  chip 
knowledge.  The  viability  of  this  approach  stems  from  producing  is  being  accomplished  through  computer 
the  ability  of  neural  nets  to  handle  ill-structured  numerical  control  (CNC).  In  most  cases  during  the 
problems,  and  their  capability  of  generalization,  planning  and  programming  stages  for  the  CNC  machining 
Simulation  results,  applied  to  tool  material  selection  and  operations,  it  is  necessary  to  consult  an  expert.  If  a  product 
tool  entry  strategy,  show  a  high  potential  for  the  model  that  contains  all  processing  parameters  necessary  for 
development  of  neural  network  modules  for  similar  the  manufacture  of  the  product  exists,  then  a  subset  of  these 
process  planning  functions.  would  contain  information  regarding  any  machining  that 

would  need  to  take  place  during  a  particular  interval  of 
1;0  INTRODUCTION  development.  The  specific  set-up  and  run  parameters  for 

the  machining  process  must  be  explicitly  determined. 
Process  planning  is  a  function  which  maps  the  design  Process  planning  for  CNC  machining  is  a  multivariable 
features  to  manufacturing  features  [1].  It  represents  the  input/output  process  considering,  among  others,  part 

link  between  engineering  design  and  shop  floor  dimensions,  material,  surface  finish,  fixture  rigidity,  tool 

manufacturing,  and  determines  ..the  manufacturing  material,  spindle  speed,  feed  rate,  depth  of  cut,  etc..  As 
operations  required  to  transform  a  part  from  a  rough  state  related  to  milling,  the  output  decision  variables  can  be,  for 
to  a  finished  state  specified  by  the  engineering  drawing,  instance,  cutting  tool  material  type,  the  strategy  for  entry 
As  related  to  machining,  process  planning  consists  of  a  of  the  tool  into  part  material,  and  the  cutting  strategy  once 
series  of  tasks  to  interpret  the  product  model  including  the  part  material  is  entered.  For  this  investigation  the  first 
selection  of  machine  tools,  tool  sets,  setups  (e.g.  design  of  two  decision  variables  have  been  investigated.  We  are 
fixtures),  and  machining  sequences  (e.g.'  generation  of  NC  focusing  on  two  aspects,  first,  the  ability  of  the  human  (and 
programs)  [2].  As  such,  process  planning  is  an  involved  in  the  long  run  the  ability  of  a  pattern  recognition  system) 
activity  that  has  a  large  number  of  input  variables  to  identify  manufacturing  features  that  need  to  be  machined 

describing  the  part  to  be  produced  and  the  production  such  as  a  pocket,  notch,  slot,  etc.  and  second,  the  ability  to 

resources.  .  store  machining  process  planning  information  in  a 

convenient  manner  for  retreival.  Once  a  feature  is 
There  are  two  systematic  computer-aided  process  planning  identified  it  will  have  parameters  and  strategies  developed 
strategies;  namely  variant  planning  and  generative  specific  to  its  nature. 
planning.  The  distinguishing  feature  of  the  variant 

planning  strategy  is  that  former  plans  are  retrieved  and  2.0  ARTIFICIAL  NEURAL  NETWORKS 
modified  for  new  parts,  while  generative  planning  is  a 

strategy  that  strives  to  create  a  new  plan  for  a  part,  from  There  has  been  an  increase  in  interest  in  *brain  style 
scratch,  based  on  analyzing  part  geometry  and  other  related  computing’  in  terms  of  artificial  neural  networks  (ANNs). 
specifications  [1].  Artificial  Intelligence  (AI)  technology,  ANNs  are  parallel  distributed  processing  architectures 
Group  Technology  (GT),  [3],  and  Expert  Systems  (ES)  composed  of  a  large  number  of  small  interacting  elements 
technology  [4,5,6]  have  been  employed  to  assist  process  that  are  massively  interconnected.  Each  of  these  elements 
planning.  Rule-based  expert  systems  based  on  experience  sends  excitatory  and  inhibitory  signals  to  other  units  that  in 
extracted  from  machinists  have  been  built  to  generate  turn  update  their  behaviors  based  on  these  received 
process  plans.  Various  methodologies  have  been  employed  messages.  ANNs  emulate  the  functionality  of  the  human 
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brain  that  implicitly  stores  knowledge  in  the 
interconnection  weights,  and  not  in  the  neurons  themselves 
£13J.  The  learning  process  (i.e.  knowledge  acquisition) 
♦alrftg  place  by  presenting  sample  cases  to  the  network  and 
modifying  the  connection  weights  between  elements 
(through  a  learning  algorithm)  to  model  the  representation 
of  the  cases  [14].  Knowledge  is  stored  in  the  final 
interconnections  values.  Because  training  examples  need 
not  be  exhaustive,  neural  network  methodology  would  ease 
the  knowledge  acquisition  bottleneck  that  is  hampering  the 
creation  of  expert  system  [15].  The  key  to  a  neural 
network's' power  is  the  inherent  parallelism  resulting  in  fast 
computation,  robustness  or  error  resistance,  and 
adaptiveness  or  generalization.  A  typical  network  consists 
of  a  set  of  processing  elements  (PE)  grouped  hierarchically 
in  layers  and  interconnected  in  some  fashion.  These  PEs 
sum  N  weighted  inputs  and  transfer  that  outcome  through  a 
mathematical  function.  Figure  1  shows  a  processing 
element  and  a  three  layer 


neural  network.  One  of  the  most  popular  neural 
paradigms,  and  the  one  that  has  been  used  in  this  paper,  is 
the  multilayer  perception  trained  using  the  Error  Back 
Propagation  Training  Algorithm  (EBPTA)  [16].  Within 
the  manufacturing  environment  ANNs  have  been  applied 
to  qualify  control,  process  control,  part-family  classification 


[17,18,19,20,21].  Neural  network  models  have  also  been 
used  for  optimizing  cutting  parameters  of  taming 
operations  [22]. 

3.0  PROCESS  PLANNING:  A  PATTERN 
RECOGNITION  PROBLEM 

Pattern  Recognition  (PR)  can  be  characterized  as  an 
information  mapping  process  taking  place  over  a  set  of 
(metric/non-metric)  spaces  [14].  An  abstract  view  of  the 
PR  description  problem  is  shown  in  Figure  2.  A 
relationship  G,  stands  as  a  mapping  between  class- 
membership  space,  C,  and  a  pattern  space,  P.  Each  Class 
Wj  corresponds  to  a  subset  of  patterns  in  the  pattern  space, 
where  the  Ith  pattern  is  denoted  as  pv  These  pattern  spaces 
may  overlap  allowing  patterns  from  different  classes  to 
share  the  same  attributes.  Figure  2  shows  mapping  from 
the  pattern  space  to  an  observation  space,  with  feature 
patterns  mv  via  the  relationship  M. 


u 


Figure  2.  PR:  mapping  in  an  abstract  form 


All  patterns  pertaining  to  the  different  spaces  are 
represented  in  an  abstract  form  by  vectors,  and  any  given 
space  may  assume  a  specific  dimension,  n,  and  thus  be 
represented  generically  as  Rn.  Depending  on  the  nature  of 
the  problem  (the  pattern  generating  environment)  and  on 
how  well  it  can  be  structured,  a  suitable  PR  technique  can 
be  applied.  For  example,  statistical  PR  can  be  used  when 
patterns  come  from  a  statistically  characterized 
environment  However,  quantitative  precise  descriptions  of 
some  complex  decision  problems  may  be  largely  unknown 
and  a  detailed  characterization  may  be  hopeless.  A  viable 
alternative  is  to  treat  the  problem  from  an  input/output  or 
"black  box'  viewpoint  [14].  A  Neural  pattern  recognizer  is 
a  good  example  of  black  box  systems.  Because  process 
planning  (PP)  is  such  an  ambiguous  and  ill-structured 
problem  (at  least  in  certain  aspects),  it  seems  reasonable  to 
approach  PP  using  neural  network  PR  techniques. 

Process  planning  can  be  cast  as  a  pattern  recognition 
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problem;  that  is  a  set  of  mappings  from  one  space(s)  to 
another  space(s).  Process  planning  for  CNC  machining  is 
a  multivariable  input/output  process  considering,  among 
others,  part  dimensions,  material,  surface  finish,  fixture 
rigidity,  tool  material,  spindle  speed,  feed  rate,  depth  of 
cut,  etc.. 


variables  into  a  combined  input  effect,  with  no  major 
emphasis  on  any  particular  input  variable.  The  concept 
permits  capturing  inexact  relationships  between  individual 
inputs  and  outputs.  Figure  4  is  a  schematic  view  of  a  black 
box'  process  planner.  Here,  the  output  decision  variables 
are  the  entry  strategy,  and  cutting  tool  material. 


The  input  decision  variables  pertaining  to  the  part  to  be 
manufactured  are:  feature  type,  feature  perimeter,  length, 
width,  depth,  comer  radius,  material,  tolerance,  and  finish 
as  shown  in  Figure  3.  Other  input  decision  variables  are 
machinist  preference  and  delivery  requirement  of  the  part. 
Each  of  the  output  decision  variables  depends  on  a  mixture 
of  the  input  variables,  and  possibly  on  some  other  input 
variables.  The  mixture  of  the  input  variables  forms  the 
input  space,  or  in  pattern  recognition  terminology  the 
observation  space.  The  output  variables  forms  the  output 
space,  or  likewise  the  decision  space.  The  idea  of  the 
black  box'  is  to  collapsethe  effects  of  individual  input 


Figure  4.  A  Black  Box  View:  Tool  Type  Selector 


4.0  A  NEURAL  NETWORK  PROCESS 
PLANNER 

The  research  methodology  adopted  is  aimed  at  exploring 
the  utilization  of  neural  network  models  as  supportive 
decision-making  tools  in  machining  process  planning  and 
as  a  knowledge  store.  To  carry  out  this  objective,  two 
neural  network  architectures  have  been  implemented  and 
tested  to  make  decisions  on  tool  type  material  and  entry 
strategy  for  a  CNC  milling  operation.  For  each  network  a 
necessary  subset  of  milling  process  parameters  have  been 
identified  as  shown  in  Tables  la,  lb. 

Table  la. 


Network  1. 

Output  decision  variable:  Tool  material  type 


Input  variables: 

Values  (classes)  of  output: 

Feature  type 

Depth  of  cut 

Carbide 

Part  material  hardness 

Ceramic 

Machinability 

High  speed  steel 

Fixture  rigidity 

M/C  max  rpm,  feed, 
horsepower 

Table  lb. 

Network  2. 

Output  decision  variable:  Tool  Entry  strategy 


Input  variables: 

Values  (classes)  of  output* 

Feature  type,  #  of  Islands, 
smallest  gap 

Feature  Width,  Depth,  Length, 
Perimeter 

Plunge 

Part  material  hardness 

Free  entry 

Fixture  rigidity 

Drillhole 

M/C  max  rpm,  feed, 
horsepower 

Ramp 

Machinability,  Comer  Radii, 
Surrounding  geometry 

The  development  process  of  the  network  consists  of  two 
main  steps.  First,  a  preliminary  data  set  was  obtained  for 
each  decision  variable  considered,  and  an  initial  network 
architecture  was  selected  for  training  to  make  sure  that  a 
neural  model  of  the  problem  can  be  generated  (for 


preliminary  design,  the  number  of  hidden  nodes  was  set 
equal  half  the  input  pattern  dimension,  along  with  a 
learning  rate  of  .1  and  a  momentum  factor  of  .9).  Second, 
using  a  refined  set  of  examples,  a  sensitivity  analysis 
procedure  was  conducted  to  determine  optimal  network 
design  parameters.  The  data  collection  phase  was  carried 
out  at  the  Integrated  Manufacturing  Systems  Laboratory  at 
New  Mexico  State  University  from  interviews  with 
machinists.  Forty  five  training  examples  were  constructed 
for  each  decision  output  variable  (i.e.  about  10-15  examples 
for  every  output  class)  which  provided  a  good  level  of 
discrimination  between  classes. 

Two  pre-processing  steps  took  place  before  presenting  data 
to  the  neural  network.  First,  as  shown  in  Table  1,  input 
variables  range  in  type  and  value.  Some  variables  can 
intrinsically  have  numeric  values  (e.g.  depth,  hardness), 
while  others  (e.g.  machinability  and  fixture  rigidity)  can 
best  be  described  in  linguistic  form,  such  as  high,  medium 
or  low.  Since  neural  nets  can  only  operate  on  numeric 
data,  linguistic  variables  have  been  quantified  so  that  they 
have  numeric  values  between  0  and  1.  Second,  the  values 
of  variables  can  range  from  ,0001's  (for  feed,  tolerance)  to 
lOk's  (for  machine  tpm).  During  training  of  a  network,  a 
wide  range  of  values  can  introduce  a  large  amount  of  bias 
in  the  values  of  connection  weights  (i.e  input  variables  of 
high  values  tend  to  be  over-emphasized).  To  alleviate  this 
problem,  all  input  variables  have  been  normalized  so  that 
their  values  fall  in  the  range  0  and  1.  This  range  is 
actually  a  preferable  choice  for  network  operation;  as  the 
activation  function  (unipolar  sigmoid)  assumes  the  samp 
range,  thus  resulting  in  moderate  values  of  the  weights. 
When  using  the  sigmoid  activation  function,  a  node  cannot 
have  an  output  of  1  or  0  unless  a  connection  weight  has  an 
infinitely  positive  or  negative  value  [23],  Therefore,  .9  and 
.1  have  been  used  as  cutoff  values  for  classification 
During  the  learning  phase,  the  network  learns  all  possible 
classes  and  corresponding  regions,  and  consequently, 
during  the  testing  phase,  the  network  recognizes  a  new 
input  case  (vector)  by  assigning  it  to  the  closest  decision 
class.  -New  cases  are  represented  to  the  network  as  input 
vectors  and  the  network  will  recommend  the  closest 
possible  solution  in  terms  of  the  output  variable. 

5.0  NETWORK  TRAINING  AND  RESULTS 

This  section  presents  results  of  network  analysis  for  the  two 
networks  under  study;  namely  the  tool  material  network 
and  the  entry  strategy  network.  Throughout  the  analytic 
training  was  carried  out  using  a  two-layer  network  (one 
hidden  node)  with  a  momentum  factor  of  .9,  and 
convergence  target  error  of  5%  (that  is,  the  output 
generated  by  the  network  was  required  to  be  within  5%  of 


the  desired  values).  To  reduce  the  effect  of  random  initial 
weights  selected  by  the  simulation  program,  each 
simulation  run  presented  here  is  an  average  of  two  run 
repetitions.  Performance  measures  were  monitored  for 
various  values  of  the  learning  constant  and  number  of 
hidden  nodes. 


For  the  tool  material  network.  Table  2  shows  the  results  of 
convergence  cycles  vs.  number  of  hidden  nodes  at  a 
learning  rate  of  .1.  Generally  the  number  of  convergence 
cycles  increases  with  increase  the  number  of  hidden  nodes. 
Hidden  nodes  provide  the  mechanism  for  representing  the 
knowledge,  and  so  it  would  be  reasonable  to  include  as 
many  as  possible  without  causing  excessive  or  unnecessary 
computation.  Thus,  for  this  network  the  most  appropriate 
number  of  hidden  nodes  is  16  as  indicated  in  Table  2.. 


Table  2.  Tool  material  network: 


network 

U 

#  of  hidden 

neurons 

#of 

converg. 

1. 

5 

1442 

2. 

8 

2161 

3. 

11 

1461 

4. 

13 

1156 

5. 

16  •optimal 

817 

6. 

20 

1301 

7. 

25 

1638 

Table  3  shows  the  results  of  the  convergence  cycles  vs.  the 
learning  rate.  As  expected  there  is  a  general  increasing 
trend  of  the  convergence  cycles  as  the  learning  rate 
becomes  smaller. 


Table  3.  Tool  material  network: 
convergence  cycles  vs.  learning  rate 
(  hidden  neurons=16) _ 


learning  constant(r|) 

convergence 

cycles 

0.10 

1034 

0.20 

513 

0.35 

691 

0.40 

577 

A  0.2  learning  rate  was  used  as  it  provided  the  best 
convergence  rate  For  the  entry  strategy  network,  the 
learning  rate  was  varied  for  different  values  of  the  number 
of  hidden  nodes  as  shown  in  Table  4.  The  nmr>tv»r  0f 
cycles  decreased  as  the  learning  rate  increased  for  different 
number  of  hidden  nodes  except  where  the  number  of 
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hidden  nodes  is  16.  This  network  did  not  converge  at  any 
learning  rate  when  the  number  of  hidden  nodes  was  25;  it 
kept  oscillating  with  an  average  error  value  of  61.  So  for 
this  network  the  number  of  hidden  used  was  20  along  with 
a  0.3  learning  rate. 


Table  4.  Entry  strategy  network:  convergence 
cycles  vs.  hidden  nodes  &  learning  constant 


Hidden  nodes 

8 

16 

20 

25 

Learning  constant 

.1 

1973 

1254 

1357 

• 

.2 

1425 

890 

749 

♦ 

.3 

!  824 

950 

639 

* 

*  did  not  converge 


A  second  important  performance  measure  of  network 
operation  is  the  percentage  of  classification  errors.  The 
networks  designed  were  tested  first,  using  the  45  training 
examples,  and  second,  using  a  new  set  of  20  examples  that 
were  not  used  for  training.  For  the  tool  material  network, 
when  tested  using  the  training  examples,  the  performance 
of  the  networks  exceeded  the  expectations;  as  it  was  able  to 
recognize  at  least  37  of  the  40  training  examples.  This 
result  supports  the  fact  that  a  neural  model  can  be 
synthesized  to  assist  satisfactorily  in  the  process  planning 
decision-making  process.  When  used  to  classify  the  new 
patterns,  the  network  correct  classification  rate  ranges  from 
69%  for  a  learning  constant  of  .30  and  16  hidden  nodes  to 
92%  for  a  learning  constant  of  0.10  and  the  same  number 
of  hidden  nodes.  These  results  are  summarized  in  Table  5 
as  averages  of  two  trials.  Using  this  performance  measure 
it  seemed  that  a  .1  learning  rate  is  preferable  to  .2  and  .3 
as  the  average  correct  classification  rate  is  the  highest  for  a 
.1  learning  rate.  However,  it  is  at  the  expense  of  increased 
convergence  cycles  and  time  (see  Table  3). 

Similar  analysis  was  conducted  for  the  entry  strategy 
neural  network.  Again  this  network  was  able  to  correctly 
classify  at  least  90%  of  the  examples  used  for  training. 
When  the  network  was  tested  for  new  examples  the 
performance  dropped  down  to  a  best  value  of  74%  .  The 
results  of  the  correct  classification  percentage  of  the 
network  trained  with  a  20  hidden  nodes,  .9  momentum 
factor,  and  5%  target . 

CONCLUSION 

This  paper  has  shown  the  viability  of  utilizing  the  neural 
networks  approach  for  assisting  the  process  planner  or 
CNC  programmer  in  obtaining  machining  set-ups  and 
strategies.  It  has  been  demonstrated  that  uncertainty  and 
Table  5.  Tool  material  network  classification  performance 


hidden 

nodes 

learning 

constant(T|) 

correct 

classification  (%) 

correct 

classification^) 

Training  Examples 

Test  Examples 

5 

.10 

95 

88 

11 

.10 

95 

90 

16 

.10 

100 

92 

5 

.20 

90 

80 

11 

.20 

95 

85 

16 

.20 

97 

90 

5 

.30 

95 

80 

11 

30 

95 

76 

16 

.30 

95 

69 

vagueness  inherent  in  the  minds  of  "experts"  can  be 
handled  using  a  neural  network  approach  within  certain 
limits  of  error.  The  methodology  shows  a  promise  as  a 
technique  of  storing  knowledge  and  being  retrievable  in  a 
rapid  manner.  As  new  manufacturing  strategies  become 
proven  for  industry,  the  neural  network  model  can  be  easily 
retrained  to  handle  updated  information. 

The  optimal  design  of  a  neural  network  architecture  is 
achieved  through  running  simulations  and  in  this  case  the 
"best"  was  sixteen  hidden  nodes  and  0. 1  learning  factor  for 
the  tool  material  network,  and  20  hidden  nodes  with  a  .1 
learning  rate  for  the  entry  strategy  network,  as  measured  by 
the  percentage  of  correct  classification.  During  the  toting 
phase,  the  classification  accuracy  of  the  tool  material 
network  averaged  about  80%  correct  classification  over  a 
range  of  69%  to  92%.  However,  the  average  classification 
accuracy  for  the  entry  strategy  network  ranged  from  43%  to 
74%.  These  results  should  be  encouraging  as  they  provide 
evidence  that  a  neural  network  model  can  represent  some 
aspects  of  process  planning  decision-making. 

At  this  stage,  this  pilot  neural  network  process  planner  can 
enhance  process  planning  by  providing  initial  assistance  in 
the  decision  process,  either  in  a  computer-aided  or  human, 
made  decison-making  environment.  Future  work  is 
planned  in  constructing  more  training  examples  that  are 
comprehensive  and  representative  of  the  decision  cases 
considered.  It  follows  that  these  result  arrays  can  be  a  good 
means  of  storing  the  knowledge  in  a  receivable  format 
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ABSTRACT 

The  paper  presents  results  of  a  study  on  collecting  machining  strategies  for  machining 
assistants  and  process  planning.  These  efforts  are  being  conducted  at  the  NMSU-Integrated 
Manufacturing  Systems  Laboratory  (IMSL).  Goals  of  the  project  aim  at  improving  and 
advancing  the  solicitation,  documentation,  and  automation  of  machining  knowledge/data 
acquisition,  and  integration  with  CAD/CAM/CAE  systems.  This  paper  emphasizes  the 
knowledge  acquisition  phase  of  the  study  utilizing  artificial  neural  networks. 

INTRODUCTION 

Designing  and  developing  knowledge  based  systems  for  Computer  Numerical  Control 
(CNC)  machining  operations  is  a  formidable  task  due  to  the  wide  variety  of  part  designs  and 
the  inability  to  anticipate  local  operating  constraints.  What  works  in  one  place  does  not 
necessarily  work  in  another.  The  steep  learning  curves  and  high  costs  for  implementing  and 
using  CAD/CAM  has  resulted  in  a  very  slow  shift  toward  computerized  machining.  In  order 
to  cut  lead  times  and  reduce  costs  for  operating  CNC  equipment,  systems  are  needed  which 
can  be  easily  tuned  and  adjusted  during  implementation  and  then  continuously  improved. 

Planning  and  developing  Numerical  Control  (CNC)  programs  for  the  production  of 
mechanical  parts  is  an  expensive  and  time-consuming  process  [1].  Part  geometry,  design 
specifications,  local  expertise  and  operating  constraints  form  the  basis  for  the  selection  of 
machining  strategies  which  must  be  translated  (by  one  of  various  methods)  into  specific 
machine  controller  instructions.  A  process  engineer  (domain  expert),  who  traditionally 
performs  these  functions  manually,  routinely  applies  learned  skills  to  prescribe  manufacturing 
operational  steps  and  specific  machining  parameters. 

The  IDEF3  (ICAM  Definition  Language  3)  structured  model  of  the  thought  process 
followed  by  a  CNC  programmer  helps  maintain  a  consistency  throughout  the  machining 
assistants  because  it  can  be  used  to  acquire  and  represent  knowledge,  format  the  decision 
processor,  and  assist  the  user  in  reviewing  the  reasons  for  suggested  methods  after  a  tool  path 
is  built.  One  main  focus  is  to  document,  analyze,  and  classify  various  machining  input 
parameters  and  techniques  accumulated  through  literature  review  and  interviews  with  experts. 

Commercial  CAD/CAM/CAPP  software  do  not  currently  facilitate  the  automatic 
accumulation  of  design  and  manufacturing  data  produced  throughout  the  planning  and 
production  of  parts.  Today’s  CNC  machine  tool  programming  methods  are  inefficient  and  do 
not  utilize  the  product  model  concept.  Plans  are  often  produced  and  executed  on  the  fly, 
modified  on  only  the  final  hard  copy,  and  inaccessible  by  people  responsible  for  job  costing. 
Furthermore,  the  intelligence  and  know-how  to  manufacture  and  track  parts  resides  in  wide 
base  of  skilled  crafts  people,  process  engineers  and  manufacturing  experts.  Advanced  NC 
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programming  systems  will  be  necessary  to  exercise  the  use  of  a  product  model.  The  Computer 
Aided  Acquisition  and  Logistics  Support  (CALS)  and  Product  Definition  Exchange 
Specification  (PDES)  /  Standard  for  the  Exchange  of  Product  Data  (STEP)  initiatives  are 
currently  addressing  these  very  issues. 

MACHINING  STRATEGY 

The  feature  being  machined,  available  tooling,  machine  accuracy,  and  setup  rigidity  can 
influence  one’s  decision  drastically.  Many  times,  duplicate  tools  are  used  to  both  rough  and 
finish  (especially  when  considering  a  large  number  of  parts)  so  that  results  can  be  repeated 
consistently.  Setup  decisions  are  based  upon  the  configuration  of  the  part,  the  material  and  the 
tooling  available.  Setup  choices  for  a  particular  feature  can  include  tool  selection, 
depth/width/cut,  speed  and  feed. 

Features.  The  concept  of  "features"  plays  an  important  role  in  linking  design  to 
manufacturing.  CAM-I’s  Current  Status  of  Features  Technology  defines  features  used  in 
process  planning  for  manufacturing  as  "shapes  and  technological  attributes  associated  with 
manufacturing  operations  and  tools  [1]."  From  a  machinist’s  viewpoint,  the  first  step  in 
developing  a  machining  plan  is  to  break  the  part  into  features.  Each  feature  is  characterized 
as  a  structural  entity  whose  attributes  specify  lower-level  geometry,  topology,  tolerance, 
surface  finishes,  etc. 

Tool  Selection.  Roughing  and  Finishing  tools  are  capable  of  cutting  in  a  wide  variety 
of  materials  and  orientations.  The  tool  selection  process  can  be  separated  into  a  series  of 
choices  between  materials  and  geometric  attributes  associated  with  grooving  tools.  The  choices 
include  a  tool  material,  tool  width,  tool  shape,  tool  length,  and  comer  radius. 

Deep  Hole  Drilling.  Drill  geometry  is  such  that  abrupt  retraction  out  of  a  deep  hole 
drilling  process  can  result  in  tool  breakage.  To  alleviate  this  problem,  programming  dwell  for 
two  spindle  revolutions  will  let  the  chip  thin  to  a  zero  thickness  and  allow  a  clean  retraction. 
This  type  of  knowledge  is  seldom  incorporated  in  a  standard  drill  cycle  on  a  controller  and 
must  be  added  through  programming  mehods. 

Etc.  The  knowledge  parameters  need  to  be  evaluated  by  experts  and  incorporated  into 
a  corporate  data  base. 

KNOWLEDGE  ACQUISITION  AND  REPRESENTATION 

A  machinist  in  charge  of  designing  a  tailored  process  plan  is  always  faced  with  the 
problem  of  selecting  from  a  vast  variety  of  possible  combination  values  of  machining 
parameters,  such  as  speed,  feed,  depth  of  cut,  tool  material  and  size,  tool  entry  strategy, 
cutting  strategy,... etc.  "It  is  very  difficult  to  construct  general,  and  yet  simple,  rules  capable 
of  handling  all  or  most  machining  cases,"  said  Joe  Alejandrez,  a  20-year-experience  machinist 
form  the  Physical  Science  Laboratory  (PSL).  It  is  difficult  to  obtain  the  assistance  of  skilled 
machinists  when  attempting  to  build  an  automated  programming  system.  Information  is 
difficult  to  communicate  verbally  and  formalize  for  the  purpose  of  designing  a  knowledge  base. 
A  key  factor  in  the  effort  lies  in  the  re-training  of  shop-floor  workers.  If  some  decisions  they 
normally  make  are  being  replaced  by  computer  programs,  then  new  knowledge  must  be  given 
in  return.  The  services  of  an  experienced  lathe  operator  (Ken  Davis,  17  years  between 
FreightLiner  and  BOEING)  for  two  months,  greatly  contributed  to  the  project  success. 

Automatic  creation  of  NC  programs  requires;  1)  user-interaction  -  to  gather  location 
specific  operating  information,  2)  interrogation  of  the  features  geometric  description  within  the 
CAD  database,  3)  a  file  based  communication  link  to  a  decision  processor  and,  4)  An 
applications  interface  language  capable  of  building  tool  paths  from  user  generated  programs. 
Very  few  commercial  CAD/CAM  systems  provide  the  user  with  enough  power  to  accomplish 
these  tasks.  Many  CAPP  systems  have  had  to  build  their  own  primitive  CAD  system  for  input 
or  used  symbolic'  languages  to  output  the  parts  geometric  description.  Obvious  extensions  to 
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the  automated  process  planner  described  here  include  automatic  feature  recognition,  enhanced 
decision  processing  methods,  part  program  optimization  routines,  and  Group  Technology  (GT) 
coding  schemes  for  individual  rotational  features.  For  this  project  we  utilized  ANVIL5000 
CADD/CAM/CAE  software  for  its  application  interface  and  integrated  capabilities. 

ARTIFICIAL  NEURAL  NETWORKS  (ANNs) 

ANNs  are  parallel  distributed  processing  architectures  composed  of  a  large  number  of 
small  interacting  elements  that  are  massively  interconnected.  Each  of  these  elements  sends 
excitatory  and  inhibitory  signals  to  other  units  that  in  turn  update  their  behaviors  based  on  these 
received  messages.  ANNs  emulate  the  functionality  of  the  human  brain  that  implicitly  stores 
knowledge  in  the  interconnection  weights,  and  not  in  the  neurons  themselves  [3].  A  typical 
network  consists  of  a  set  of  processing  elements  (PE)  grouped  hierarchically  in  layers  and 
interconnected  in  some  fashion.  These  PEs  sum  N  weighted  inputs  and  transfer  that  outcome 
through  a  mathematical  function.  One  of  the  most  popular  neural  paradigms  is  the  multilayer 
perceptron  trained  using  the  Error  Back  Propagation  Training  Algorithm  (EBPTA)  [2]. 

Neural  network  models  have  key  advantages  over  rule-based  systems.  First,  the 
learning  process,  or  the  knowledge  acquisition  process,  takes  place  by  presenting  only  sample 
machining  cases  to  the  network.  Learning  occurs  through  modifying  the  connection  weights 
between  network  elements  (by  a  learning  algorithm)  to  model  the  representation  of  the  cases 
[4].  Knowledge  is  thus  stored  implicitly  in  the  final  interconnections  values.  Second,  because 
training  examples  need  not  be  exhaustive,  it  is  conceivable  that  neural  network  methodology 
could  ease  the  knowledge  acquisition  bottleneck  that  is  hampering  the  creation  of  rule-based 
systems  [5].  Finally,  when  utilized  as  a  decision  processor,  an  artificial  neural  network  has 
inherent  parallelism  resulting  in  fast  computation,  robustness  or  error  resistance,  and 
adaptiveness  or  generalization. 

A  NEW  KNOWLEDGE  ACQUISITION  METHODOLOGY 

This  methodology  should  satisfy  a  number  of  basic  requirements.  First,  it  must  reduce 
the  burden  associated  with  developing  rule-based  systems  by  avoiding  constructing  general 
rules,  because,  as  explained  earlier,  experts  are  not  good  at  generalizing  solutions.  Second, 
this  methodology  must  ensure  that  knowledge  be  cast  in  a  form  usable  by  neural  systems; 
forms  of  rules  are  excluded.  What  is  required  by  a  neural  system  is  a  set  of  training  examples, 
where  each  example  represents  a  real  machining  case  described  by  a  vector  of  numeric  values 
of  process  variables  encoded  in  a  convenient  from.  Third,  consequently,  the  new  methodology 
must  employ  effective  instrumental  knowledge  gathering  forms;  and  since  no  general  inferential 
statements  are  required  at  this  stage,  the  knowledge  gathering  forms  can  be  mere  data 
collection  forms.  The  basic  idea  underlying  this  procedure  is  to  enable  the  user  to  provide 
relevant  training  examples  only,  and  the  neural  network,  through  the  learning  algorithm,  will 
implicitly  derive  the  IF— THEN  rules. 

Having  decided  on  the  output  Process  Planning  (PP)  decision  variables  to  be  automated, 
the  next  step  was  to  find  all  related  PP  input  variables  that  could  affect  the  selection  of  the 
output  variable.  This  process  took  place  in  an  iterative  manner  as  follows: 

-  Initially,  a  long  list  of  possible  input  variables  was  constructed  that  could  collectively 
drive  the  selection  process  of  all  output  variables  simultaneously. 

-  It  was  found  that  not  all  selected  input  variables  do  contribute  to  the  selection  of  a 
given  output  decision  value. 

-  This  revisited  understanding  revealed  that  some  of  the  input  variables  had  to  be 
deleted  as  they  did  not  affect  selecting  the  value  of  the  output  parameter.  At  this  point,  our 
forms  were  judged  to  provide  ’good’  instruments  to  start  collecting  data.  Each  form  deals  with 
one  output  variables,  and  only  relevant  non-redundant  input  variables. 

-  During  data  collection,  our  expert  machinists  tended  to  emphasize  some  input 
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variables  in  certain  cases,  while  giving  less  weight  to  the  same  variables  when  dealing  with 
other  cases.  Weights  were  assigned  to  variables  according  to  0-10  scale  in  an  increasing  order 
of  merit. 

-  Using  the  data  forms,  a  good  number  of  examples  were  constructed  (about  50)  for 
output  decision.  At  this  stage,  it  was  observed  that  some  input  variables  constantly  scored  low 
weight  rating  (e.g.  2,  3).  Those  variables  were  deleted  form  the  forms  as  they  were 
considered  to  represent  noise.  This  also  reduced  the  dimensionality  of  the  input  vector. 

In  light  of  the  above  requirements,  a  neural-based  machining  knowledge  acquisition 
technique  is  designed  and  implemented  during  the  course  of  developing  a  neural  model  as  a 
supportive  decision  making  tool  in  machining  process  planning. 

CONCLUSIONS 

Computerized  machining  processes  are  slowly  becoming  commonplace  in  both  large  and 
small  companies.  Machine  operators  and  programmers  must  be  included  in  the  design  and 
development  of  such  applications.  Many  systems  are  dependent  on  a  skilled  computer 
programmer  to  tune  the  knowledge  and  rules  embedded  in,  what  is  often,  very  cryptic  code. 
Creative  inference  techniques  and  clever  knowledge  structures  cannot  solve  problems  that  are 
not  well  understood  by  the  applications’  designers.  The  straightforward  use  of  data  collection 
techniques  that  ANNs  employ  can  be  successfully  used  in  intelligent  machining  assistants. 
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ABSTRACT: 

This  paper  presents  results  from  an  Army  Research  Office  grant 
developing  knowledge  based  systems  for  Computer  Numerical  Control 
(CNC)  machining  operations.  One  of  the  areas  of  development 
involves  knowledge/data  acquisition  for  input  parameters  to  the 
knowledge  processor.  In  our  work  we  chose  artificial  neural  networks 
as  a  mechanism  for  capturing  data  and  then  utilized  the  ability  of  the 
networks  to  determine  certain  process  parameters  for  automating 
specific  tasks  in  CNC  programming.  Artificial  neural  networks  are  well 
adapted  to  this  task  because  they  can  be  continuously  improved. 

INTRODUCTION 

An  increasing  need  for  the  rapid  production  of  small  batches  and  prototype  parts  will 
drive  manufacturers  to  seek  new  ways  to  decrease  lead  times  and  costs  associated  with 
Computer  Numerical  Control  (CNC)  programming  and  production  planning.  Planning 
and  developing  CNC  programs  for  the  production  of  mechanical  parts  can  be  an 
expensive  and  time-consuming  process.  A  process  engineer  (domain  expert),  who 
traditionally  performs  these  functions  routinely  applies  learned  skills  to  prescribe 
manufacturing  operation  steps  and  specific  machining  parameters.  One  author 
summarizes  the  problem  as  follows: 

This  knowledge  is  better  described  as  many  differently  weighted  pieces 
of  advice  than  as  a  small  set  of  constraints  to  be  satisfied.  Such 
planning  involves  taking  into  account  a  great  variety  of  both 
technological  rules  and  economic  considerations.  Furthermore,  they 
are  not  absolute.  They  are  more  preferences,  between  which 
compromises  may  be  necessary.  In  addition,  they  may  differ  somewhat 
from  one  company  to  another.  In  fact,  they  represent  the  experience 
and  know-how  of  engineers.  An  immediate  consequence  is  that  the 
planning  of  machining  sequences  definitely  does  not  have  a  unique 
solution.  (Descotte  and  Latombe,  1989) 

Problem  Identification 

Commercial  CAD/CAM/CAPP  software  do  not  currently  facilitate  the  automatic 
accumulation  of  design  and  manufacturing  data  produced  throughout  the  planning  and 
production  of  parts.  Today's  CNC  machine  tool  programming  methods  are  often 


1  -  24 


inefficient  and  do  not  utilize  the  product  model  concept.  Plans  are  often  produced  and 
executed  on  the  fly,  modified  on  only  the  final  hard  copy,  and  inaccessible  by  people 
responsible  for  job  costing.  Furthermore,  the  intelligence  and  know-how  to  manufacture 
and  track  parts  resides  in  a  wide  base  of  skilled  crafts-people,  process  engineers  and 
manufacturing  experts. 

Commercial  CAD/CAM  gets  limited  acceptance  among  CNC  machine  operators 
and  CNC  programmers  who  perceive  that  their  jobs  are  being  replaced  by  computers. 
And  of  course  the  systems  available  for  efficiently  capturing  the  knowledge  of  the  'skilled' 
are  for  the  most  part  still  in  a  state  of  development. 

Artificial  Neural  Networks 

There  has  been  an  increase  of  interest  in  brain  style  computing'  in  terms  of  artificial  neural 
networks  (ANNs).  ANNs  are  parallel  distributed  processing  architectures  composed  of  a 
huge  number  of  small  interacting  elements  that  are  massively  interconnected.  Each  of 
these  elements  sends  excitatory  and  inhibitory  signals  to  other  units  that  in  turn  update 
their  behaviors  based  on  these  received  messages.  ANNs  emulate  the  functionality  of  the 
human  brain  that  implicitly  stores  knowledge  in  the  interconnection  weights,  and  not  in 
the  neurons  themselves  (Rumelhart  and  McClelland,  1986).  Learning  is  thus  achieved  by 
modifying  the  connection  weights  between  elements.  Feedforward  neural  nets  acquire 
knowledge  through  training,  where  by  repeatedly  presenting  sample  cases  to  the  net,  its 
interconnection  weights  change  accordingly  to  model  the  representation  of  the  cases 
(Shalkof,  1992).  Knowledge  is  stored  in  the  final  interconnections  values.  As  such  neural 
network  methodology  would  ease  the  knowledge  acquisition  bottleneck  that  is  hampering 
the  creation  of  expert  system  (Zurada,  1992).  The  key  to  a  neural  network's  power  is  the 
inherent  parallelism  resulting  in  fast  computation,  robustness  or  error  resistance,  and 
adaptiveness  or  generalization.  Atypical  network  consists  of  a  set  of  processing  elements 
(PE)  grouped  hierarchically  in  layers  and  interconnected  in  some  fashion.  These  PEs  sum 
N  weighted  inputs  and  transfer  that  outcome  through  a  mathematical  function.  One  of  the 
most  popular  neural  paradigms,  and  the  one  that  has  been  used  in  this  project,  is  the 
multilayer  perceptron  trained  using  the  Error  Back  Propagation  Training  Algorithm 
(EBPTA)  (Dayhoff,  1990).  Within  the  manufacturing  environment  ANNs  have  been 
applied  to  quality  control,  process  control,  part-family  classification  (Martinez,  Smith,  and 
Bidanda,  1993;  Telasco,  1993;  Udo,  1992;  Cook  and  Shannon,  1992;  Charraprty  and 
Uptal,  1993)  Neural  network  models  have  been  also  used  for  optimizing  cutting 
parameters  of  turning  operations  (Wang,  1993). 

Neural  network  models  have  key  advantages  over  rule-based  systems.  First,  the 
learning  process,  or  the  knowledge  acquisition  process,  takes  place  by  presenting  only 
sample  (machining)  cases  to  the  network.  Learning  occurs  through  modifying  the 
connection  weights  between  network  elements  (by  a  learning  algorithm)  to  model  the 
representation  of  the  cases  (Shalkof,  1992).  Knowledge  is  thus  stored  implicitly  in  the 
final  interconnection  values.  Second,  because  training  examples  need  not  be  exhaustive,  it 
is  conceivable  that  neural  network  methodology  could  ease  the  knowledge  acquisition 
bottleneck  that  is  hampering  the  creation  of  rule-based  systems  (Zurada,  1992).  Finally, 
when  utilized  as  a  decision  processor,  an  artificial  neural  network  has  inherent  parallelism 
resulting  in  fast  computation,  robustness  or  error  resistance,  and  adaptiveness  or 
generalization. 
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Computer  Aided  Process  Planning 

Computer  Aided  Process  and  operations  Planning  (CAPP)  systems  attempt  to  automate 
the  decision  making  process  necessary  to  map  design  specifications  on  the  company 
specific  manufacturing  environment.  Many  CAPP  systems  utilize  some  form  of  Artificial 
Intelligence  (AI)  to  model  the  human  interaction  previously  required  by  the  task. 
CAD/CAM  software  and  databases  are  used  to  supply  product  descriptions  and 
information  management,  respectively.  Material  removal  operations  provide  a  rich 
environment  for  the  implementation  of  intelligent  applications.  Much  of  the  current  work 
in  this  field  concentrates  on  the  AI  technique  used  to  process  the  knowledge  and/or  the 
solid  model  design  features  utilized  by  the  engineer  to  initially  create  the  part  (Anderson 
and  Chang,  1990;  Chung,  Patel,  and  Cook,  1990;  Joshi,  Vissa,  and  Chang,  1989;  Phillips 
and  Mouleeswaran,  1985). 

Manufacturing  Assistants 

Manufacturing  Assistants  are  systems  which  emphasize  the  specific  decisions  and 
associated  data  which  must  be  considered  when  selecting  tools,  cutting  speeds,  tool 
motions,  etc.  in  a  manufacturing  or  machining  environment.  As  discussed  here,  the  term 
machining  is  defined  as  the  removal  of  unwanted  material from  a  workpiece  in  chip  form 
so  as  to  obtain  a  completed  product  that  meets  size,  shape  and  finish  characteristics 
specified  by  the  designer.  The  following  factors  contribute  to  the  need  for  improved 
methods  to  manage  the  manufacture  of  machined  parts:  advanced  machine  tool  and 
computer  technology,  higher  quality  and  stricter  tolerance  and  finish  requirements,  new 
materials  such  as  plastics,  composites,  and  exotic  alloys,  smaller  batch  sizes  and  prototype 
parts,  more  complex  part  descriptions,  and  a  more  competitive  marketplace. 

The  Turning  Assistant 

The  Turning  Assistant  (TA)  is  a  direct  outcome  of  an  ARO  project  which  automatically 
creates  plans  for  Computer  Numerical  Control  (CNC)  lathe  operations.  Starting  with  a 
drawing  and  a  set  of  predefined  machines  and  cutting  tools,  the  TA  interactively  defines 
and  produces  code  for  a  variety  of  features  such  as  profiles,  grooves,  and  threads.  The 
TA  focuses  on  the  formalization  of  domain  knowledge,  providing  document  support,  and 
reducing  the  costs  and  lead  times  associated  with  the  planning  function.  For  advanced 
systems  to  be  usable  in  shops  which  currently  dominate  small  batch  production  of 
machined  parts,  flexible  tools  for  fitting  the  application  to  the  environment  must  be  made 
available. 

The  Turning  Assistant  is  a  system  that  is  based  on  previous  work  providing 
automated  programming  for  a  variety  of  milled  features.  The  "Milling  Assistant"  (MA) 
(Burd,  1989)  evolved  from  an  Expert  System  based  decision  processor  to  a  Case-Based 
Reasoning  (CBR)  system  to  automate  the  decision  process  required  for  machining  miller) 
and  point-to-point  features  on  CNC  machine  tools. 

PRODUCT  MODELS 

The  complete  electronic  product  definition  for  a  manufactured  part  consists  of  all  the 
information  related  to  designing,  producing,  inspecting,  and  managing  the  manufacture  of 
that  part.  The  data  provided  by  the  product  model  must  be  accessible,  well  organized, 
and  conform  to  standards  prevalent  in  both  industry  and  government.  The  Computer 
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Aided  Acquisition  and  Logistics  Support  (CALS)  and  Product  Definition  Exchange 
Specification  (PDES)  /  Standard  for  the  Exchange  of  Product  Data  (STEP)  initiatives  are 
currently  addressing  these  very  issues. 

Knowledge  processor 

Numerous  systems  can  be  utilized  for  decision  processing.  The  TA  decision  processor  is 
written  in  GRAPL-IV,  a  FORTRAN  77  based  application  interface  within  the  ANVIL- 
5000  CAD/CAM  system  and  is  used  to  determine  machining  strategies  and  effectively 
execute  the  operation.  Another  is  based  on  a  series  of  ANNs  written  in  C,  which  can  be 
called  as  needed  from  the  application  interface  within  the  ANVIL-5000  CAD/CAM 
system.  Another  set  of  C  programs  can  make  calls  to  MetCAPP-IH  or  MetCAPP-IV  to 
acquire  necessary  machining  parameters  and  cutting  strategies. 

KNOWLEDGE  ACQUISITION  AND  REPRESENTATION 

The  most  important  and  difficult  aspect  of  this  research  is  to  develop  effective  tools  with 
which  to  elicit  knowledge  from  domain  experts.  The  skills  obtained  through  years  of 
observation,  training,  improvising,  and  trial-and-error  are  not  easy  to  explain  verbally, 
document,  or  formalize  for  the  purpose  of  building  automated  CNC  programming 
systems.  A  significant  amount  of  time  must  be  spent  in  familiarizing  oneself  with  the  field 
and  terminology  before  attempting  to  solicit  strategies  directly  from  the  CNC 
programmers  and  machinists. 

ANNs  have  a  distinct  advantage  in  speed,  development  and  knowledge  updating 
(hence  learning).  Automatic  creation  of  CNC  programs  requires:  1)  user-interaction  --  to 
gather  location  specific  operating  information,  2)  interrogation  of  the  features  geometric 
description  within  the  CAD  database,  3)  a  file  based  communication  link  to  a  decision 
processor  and,  4)  An  applications  interface  language  capable  of  building  tool  paths  from 
user  generated  programs.  Very  few  commercial  CAD/CAM  systems  provide  the  user 
with  enough  power  to  accomplish  these  tasks. 

Representing  the  Knowledge 

This  research  helped  to  develop  knowledge  elicitation  and  representation  tools  that  can  be 
used  to  formalize  the  expertise  gained  through  machining  strategy  meetings,  literature 
review,  and  programmer  feedback.  It  includes  the  design  of  questionnaires  that  can  be 
used  to  document  strategies  in  a  set  format  during  interviews  or  literature  review  and  data 
collection  forms  for  training  ANNs  (see  Table  1). 


TABLE  1:  ANN  TOOL  TYPE  DATA  COLLECTION  FORM 


Input  Decision  Variables 

Output  Decision  Variables 

★ 

Variable  Name 

Value 

Weight  Factor 

The  correct  tool  material: 

1 

Part  (stock)  material 

0 

2 

4 

6 

8 

10 

1.  High  Speed  Steel 

2 

Part  material  bH 

0 

2 

4 

6 

8 

10 

2.  Carbide 

3 

Material  Machinability 

0 

2 

4 

6 

8 

10 

3.  Ceramic 

4 

Feature  Depth 

0 

2 

4 

6 

8 

10 

Comments: 
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Artificial  Neural  Networks  as  Knowledge  Acquisition  Tools 

The  adopted  research  methodology  aimed  at  investigating  the  utilization  of  a  neural-based 
knowledge  acquisition  approach.  It  is  based  on  the  operational  mode  of  neural  systems, 
and  taking  advantage  of  our  knowledge  and  experience  in  developing  rule-based  systems 
for  similar  tasks  (i.e.  milling  assistant  (Burd,  1990)).  This  methodology  satisfies  a  number 
of  basic  requirements.  First,  it  must  reduce  the  burden  associated  with  developing  rule- 
based  systems  by  avoiding  constructing  general  rules,  because,  as  explained  earlier, 
experts  are  not  good  at  generalizing  solutions.  Second,  this  methodology  must  ensure 
that  knowledge  be  cast  in  a  form  usable  by  neural  systems;  forms  of  rules  are  excluded. 
What  is  required  by  a  neural  system  is  a  set  of  training  examples,  where  each  example 
represents  a  real  machining  case  described  by  a  vector  of  numeric  values  of  process 
variables  encoded  in  a  convenient  from.  Third,  consequently,  the  new  methodology  must 
employ  effective  instrumental  knowledge  gathering  forms;  and  since  no  general  inferential 
statements  are  required  at  this  stage,  the  knowledge  gathering  forms  can  be  mere  data 
collection  forms 

A  neural  network  paradigm  must  first  be  selected  in  consensus  with  the  application 
at  hand.  In  this  research,  a  supervised  neural  network  model  is  employed  as  a  classifier. 
A  supervised  learning  model  is  selected  as  it  enables  the  expert  to  express  his  judgmental 
opinion  in  a  relative  manner.  This  feature  is  very  essential  because,  in  this  application,  it  is 
not  required  to  group  the  training  examples  into  clusters  as  would  be  the  outcome  if  an 
unsupervised  learning  model  were  used.  Instead,  every  single  training  example  must  be 
explicitly  identified  (by  the  expert)  as  belonging  to  a  specific  class  that  represents  a 
machining  decision.  The  neural  model  is  employed  as  a  classifier  so  that  the  output 
decision  belongs  to  a  class  of  objects  (e.g.  type  of  tool  material:  HSS,  Carbide,  Ceramic), 
or  to  a  range  of  numeric  values  (e.g.  tool  size:  l/4"-l/2",  l/2"-l",  etc.).  This  method 
shows  great  promise  considering  the  variety  of  machining  choices  that  can  satisfy  the 
requirements  of  a  quality  product.  See  Table  2  for  a  representative  example  of  the  ANN 
training  and  testing  results. 
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TABLE  2:  TOOL  MATERIAL  NETWORK  CLASSIFICATION 

PERFORMANCE 


I  Training  Examples 

Testing  Examples  I 

hidden 

learning 

identification 

hidden 

learning 

identification 

nodes 

constant^) 

correct  (%) 

nodes 

constant^) 

correct  (%) 

5 

.20 

90 

5 

.20 

80 

11 

.20 

95 

11 

.20 

85 

16 

.20 

97 

16 

.20 

90  j 
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A  Process  Planning  Technique  Utilizing  Neural 
Networks  to  Determine  CNC  Milling,  Drilling  and 
Turning  Parameters  and  Strategies 


ABSTRACT 

As  related  to  machining,  process  planning  consists  of  a  series  of  tasks 
including  interpretation  of  product  model,  selection  of  machine  tools, 
selection  of  tool  sets,  selection  of  setups,  design  of  fixtures,  selection  of 
machining  sequences,  determination  of  tool  paths,  calculation  of  machining 
parameter,  and  generation  of  NC  programs.  Process  planning  is  an  involved 
activity  that  has  a  large  number  of  input  variables  describing  the  part  to  be 
produced  and  the  production  resources. 

If  a  product  model  existed  that  contained  all  processing  parameters  necessary 
for  the  manufacture  of  the  product  then  a  subset  of  these  would  contain 
information  that  would  need  to  take  place  during  the  process  plan  development. 

Virtually  all  products  require  a  chip  producing  machine  tool  in  some  stage  of 
manufacture  and  much  of  the  chip  producing  is  being  accomplished  through 
computer  numerical  control  (CNC) . 

In  most  cases  during  the  planning  for  the  CNC  machining  operations  it  is 
necessary  to  consult  an  expert.  Whether  that  expert  is  a  machinist,  part 
programmer  or  a  knowledge  base,  the  determination  of  specific  set-up  and  run 
parameters  or  for  strategies  for  the  machining  process  must  be  known  before  a 
good  part  program  can  be  completed.  In  some  cases  this  is  simply  human  stored 
knowledge  and  in  others  it  is  the  additional  inputs  to  the  human  stored 
knowledge.  In  our  work  we  are  focusing  on  the  ability  of  the  human  (and  in 
the  long  run  the  ability  of  a  pattern  recognition  system)  to  identify  a 
feature  that  needs  to  be  machined  such  as  a  pocket,  notch,  slot,  hole, 
shoulder,  knurl,  etc.  Once  identified  that  feature  will  have  parameters  and 
strategies  developed  specific  to  its  nature,  thereby,  taking  the  larger 
problem  and  simplifying  it. 

Most  research  efforts  have  focused  on  utilizing  expert  systems  technology  to 
implement  process  planning  systems.  These  systems  offer  assistance  based  on 
capabilities  ranging  from  fast  retrieval  of  existing  plans  to  generating  plans 
based  on  interpreting  product  model  geometry.  However,  expert  systems,  due 
to  explicit  knowledge  representation,  show  limited  adaptive  capabilities,  and 
they  do  not  tolerate  missing  or  inaccurate  data. 
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To  compensate  for  some  of  these  limitations  and  enhance  the  expert  systems 
approach,  this  paper  describes  a  methodology  of  process  planning  for  CNC 
milling,  drilling  and  turning  based  on  the  concept  of  artificial  neural 
networks . 

First,  process  planning  has  been  cast  as  a  pattern  recognition  problem  and 
viewed  as  a  mapping  function  between  design  features  and  machining  features. 
Second,  a  neural  network  model  is  developed  for  automating  process  planning 
decisions  such  as  cutting  tool  material,  entry  method,  and  cutting  strategies. 
The  viability  of  this  approach  stems  from  the  intrinsic  ability  of  neural  nets 
to  handle  ill-structured  problems,  and  their  striking  ability  of 
generalization;  an  essential  property  that  does  not  smoothly  lend  itself  to 
existing  approaches  like  expert  systems  (ES)  .  Based  on  simulation  results, 
initial  investigation  applied  to  milling  processes  with  different  features, 
promise  a  high  potential  for  the  development  of  a  neural  network-based  process 
planning  system. 
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Abstract 

The  paper  presents  results  of  an  effort  to  utilize  a  commercially  available  process 
planning  database  to  provide  machining  knowledge  as  input  into  a  commercial 
CAD/CAM  package.  A  prototype  system  was  developed  which  utilizes  the  CAD/CAM 
software,  a  knowledge  base  and  the  process  planning  package  through  an  application 
interface.  The  prototype  system  based  upon  a  developed  CAD  drawing  and  basic  initial 
decisions  provides  machining  strategies  through  either  the  knowledge  base  or  the  process 
planning  package.  The  goals  of  the  project  were  to  demonstrate  the  successful  utilization 
of  a  knowledge  base,  a  commercial  process  planning  tool,  and  a  neural  net  for  inputs  into 
a  developed  machining  expert  system. 

Background 

Process  planning  is  defined  by  the  Society  of  Manufacturing  engineers  as  “the 
systematic  determination  of  the  methods  by  which  a  product  is  to  be  manufactured 
economically  and  competitively  .  As  the  average  age  of  aerospace  process  planners  has 
been  estimated  by  private  industry  and  the  Air  Force  to  be  51  to  55  years.  2  The 
replacements  for  these  planners  do  not  have  the  level  of  expertise  and  knowledge  to 
replace  what  will  be  lost  through  retirement  But  even  more  importantly,  computerized 
CAPP  systems  capture  the  expertise  so  that  the  knowledge  is  not  vulnerable  to  attrition. 
The  computer’s  ability  to  store,  retrieve,  and  update  data  en  masse  makes  it  the  ideal  tool 
to  organize  the  technological  data  representing  the  knowledge  of  senior  process  engineers 
in  a  useable  format. 

CAPP  applications  can  be  enhanced  in  three  areas,  CAPP  retrieval,  the  ability  of 
CAPP  systems  to  interface  with  other  business  systems  and  enhanced  communications.  ^ 
Process  planning  may  capture  process  information  and  render  it  useful  through  one  of 
tow  means:  variant  and  generative  process  planning.  Variant  process  planning  relies  on 
a  human  operator  and  a  retrieval  mechanism  to  retrieve  similar  process  plans  and  edit 
them  producing  a  new  variation  of  a  preexisting  process  plan:  hence,  the  name  variant 
process  planning.  Generative  process  planning  represent  knowledge  in  the  form  of 


pg.  8,  Culler,  David,  “The  Turning  Assistant:  Automated  Planning  for  Numerical  Control  T 
Operations”,  1994. 

2 

pg.  121,  Nolen,  James,  Computer-Automated  Process  Planning  for  World-Class  Mannfbrnmnn,  Marcel 
Dekker,  NY,  1989. 

3  pg.  182,  Nolen 
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processing  rules.  These  rules,  often  represented  by  decision  trees,  represent  the 
collective  manufacturing  knowledge  for  a  particular  part  family  or  category  of 
manufacturing  processes.  Once  captured,  they  may  be  used  to  generate  the  required 
sequence  of  processing  steps  for  new  products  within  the  set  of  products  for  which  the 
decision  tree  was  designed. 

Generative  process  plans  have  a  number  of  advantages.  Among  the  major  ones 
are  the  following: 

•  They  rely  less  on  group  technology  code  numbers  since  the  process  log,  usually 
captured  in  a  decision  tree,  often  categorizes  parts  into  families. 

•  Maintenance  and  updating  of  stored  process  plans  are  largely  unnecessary  since  any 
plan  may  be  quickly  regenerated  by  processing  through  the  tree.  Indeed,  many  argue 
that  with  generative  systems,  process  plans  should  not  be  stored  since  if  the  process  is 
changed,  an  out-to-date  process  plan  might  find  its  way  back  into  the  system. 

•  The  process  logic  rules,  however,  must  be  maintained  up  to  date  and  ready  for  use. 
This  provides  the  process  planner  with  the  assurance  that  the  processes  generated  will 
reflect  state-of-the-art  technology.4 

Knowledge  Based  Systems 

The  NMSU  Integrated  Manufacturing  Systems  Laboratory  (IMSL)  demonstrated 
the  feasibility  of  utilizing  expert  or  knowledge  based  systems  in  their  generative  process 
planning  machining  assistants.  An  expert  or  knowledge  based  system  is  described  as 
consisting  of  a  mass  of  information  on  a  particular  subject  (the  knowledge  base)  and  a 
program  which  permits  the  system  to  apply  its  knowledge  to  a  particular  problem.  The 
knowledge  base  consists  of  facts  and  rules.  The  facts  constitute  the  body  of  information 
that  is  widely  shared,  publicly  available,  and  generally  agreed  upon  by  experts  in  the 
field.  The  rules  are  rules  of  good  judgment  gleaned  from  a  human  domain  expert  that 
characterize  expert  level  decision  making  in  the  field. 

The  Milling  Assistant,  initially  developed  by  Bill  Burd  of  Sandia  National  Labs, 
aids  in  the  programming  of  Computer  Numerical  Control  (CNC)  machine  tools  by 
automating  the  decision  process  used  to  create  cutter  paths.  MA  is  made  up  of  a  user 
interface  to  a  commercial  CAD/CAM  system,  ANVIL5K,  analysis  programs  written  in 
GRAPL  and  FORTRAN,  and  a  decision  processor  which  contains  the  rules  and  strategies 
on  machining  features.  The  MA  utilizes  a  Case  Base  Reasoning  (CBR)  system  for 
making  strategic  decisions.  The  user,  based  upon  an  existing  part  geometry,  selects  from 
the  available  cutting  tools,  material,  and  machine  tools  available  in  the  MA.  The  MA, 
using  those  inputs,  generates  an  effective  tool  path  and  then  interacts  further  to  allow  the 
user  to  make  desired  modifications  to  the  proposed  solution. 


4pg.  161,  Nolen 
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Another  IMSL  product.  The  Turning  Assistant  (TA)  developed  by  David  Culler 
ase  upon  die  MA  model  is  a  feature  based  process  planner  for  numerical  control  lathe 
operations.  Again,  a  Fortran  77  based  decision  processor  is  used  to  determine  the 
machining  strategies.  The  decision  processor  in  both  of  these  assistants  utilizes  user 
mputs  of  tools,  stock,  machine,  and  features  and  outputs  suggested  cutting  tools,  speeds 
arid  feeds,  cut  amounts,  suggested  entry  points  and  patterns.  Through  the  development  of 
ese  programs  IMSL  staff  personnel  learned  how  time  consuming  and  difficult  it  was  to 
productionpi°Wedge  wb*ch  when  interrogated  appropriately  provides  “good” 

Artificial  Neural  Networks 

With  the  increasing  level  of  complexity  and  an  increasing  level  of  complexity  of 
manufacturing  environments,  and  a  further  increasing  demand  for  integration 
maintenance  of  the  expert  system  becomes  difficult.  In  today’s  dynamic  manufacturing 
environment,  the  expert  ^stem  shows  some  weakness  in  its  difficulty  with  being  quicldy 
modified  or  upgraded  The  efficiency  and  accuracy  of  the  decision  processor  is  in 

SSL"1™" ,0  program  1,16  dedsi™  —  *  * 

i  i  ?  ^  reco§nizet*  by  the  IMSL  personnel  that  designing  and  developing 
knowledge  based  systems  for  CNC  machining  operations  could  be  formidable  based 
upondie  wide  variety  of  part  designs  and  the  inability  to  anticipate  local  operating 
constraints.  Further  research  was  done  to  investigate  methods  of  cutting  the  lead  times 
and  costs  for  developing  process  planners  by  either  implementing  a  system  which  can  be 
e^ily  fine  tuned  and  adjusted  In  the  pursuit  of  Ihis  goal,  the  IMSL  project  staff  studied  a 

^ for  0,0  m,irag  teed  °n  ihe  «“* 

ANNs  are  parallel  distributed  processing  architectures  composed  of  a  huge 
number  of  small  interacting  elements  that  are  massively  interconnected  Each  of  these 
elements  sends  excitatory  and  inhibitory  signals  to  other  units  that  in  turn  update  their 
behaviors  based  on  these  received  messages.  Basically,  ANNs  are  built  to  imitate  the 
human  bram.  Learning  is  done  by  modifying  the  weights  of  the  connections  between 

hlTr  ,  Seve^advantaSes  were  discovered  by  using  the  neural  net  over  knowledge 
based  systems.  The  firat  was  that  the  neural  net  was  “taught”  by  presenting  sample 

^T1*-  ?etbased  on  ^  and  desired  outputs,  through  a  learning 
,^ffit^n’tdeVe  °PS  Ae  relationships  necessary  to  provide  the  desired  outcome  when 
l^ufficirat  parameters,  i.e.  mputs  without  an  output,  are  introduced  to  the  net  Secondly 
because  training  examples  do  not  need  to  be  exhaustive,  it  would  seem  to  be  an  y 

Neural  ™c 


^  ^^^“-Aided  Process  Plim™8  of  CNC 


improvement  on  the  knowledge  acquisition  bottleneck  often  seen  in  the  development  or 
modification  of  knowledge  bases. 


The  IMSL  process  planning  neural  net  application  was  developed  by  Amjed  Al- 
Ghanim  in  two  phases.  First,  process  planning  was  cast  as  a  pattern  recognition  problem, 
and  viewed  as  a  mapping  function  between  design  features  and  machining  features. 
Second,  a  neural  network  model  was  developed  for  automating  certain  aspects  of  process 
planning  decisions  such  as  the  cutting  tool  material.  Iterative  analysis  of  this  model 
served  two  objectives.  First,  the  neural  net  venture  reduced  the  burden  of  constructing 
general  rules  into  a  simple  data  collection  process.  For  the  sample  models  to  be 
developed,  effective  machining  data  collection  forms  were  designed  and  utilized. 

Second,  the  model  based  on  the  sample  cases  was  proven  capable  of  making  suitable 
machining  decisions.7 

The  neural  net  research  also  highlighted  some  weaknesses.  The  integrity  of  the 
neural  net  is  directly  related  to  the  integrity  of  the  sample  cases.  The  expert  machinist 
sometimes  inconsistently  weighted  identical  variables  from  case  to  case.  By  modifying 
the  input  form  these  inconsistencies  could  be  minimized.  Also,  the  neural  net  could 
provide  inconsistent  responses  from  day  to  day  based  upon  the  user. 

Commercial  Manufacturing  Software  Interfaces 

Both  the  knowledge  base  and  the  neural  net  applications  showed  promise  for  the 
future  but  IMSL  personnel  pursued  the  ability  of  CAPP  systems  to  interface  with  other 
software.  This  work  was  first  begun  by  Gordon  Hunt  when  he  developed  an  interface 
between  the  expert  system  database  already  developed  by  the  commercial  process 
planning  package  MetCAPP  and  the  CAD/CAM  package  ANVIL5K.  The  operation  type, 
work  piece  material,  and  tool  class  were  provided  to  the  MetCAPP  Application 
Programming  Interface  (API)  and  the  depth  of  cut,  feeds  and  speeds  are  then  returned  to 
ANVIL5K  and  input  into  the  appropriate  CAM  modal  settings  which  are  used  in  the 
development  of  tool  paths.  The  interface  was  expanded  on  with  an  integrated 
application  between  MetCAPP  and  the  Turning  Assistant.  The  option  was  added  to  the 
TA  to  access  MetCAPPs  already  extensive  database  of  materials  and  tooling  and  utilize 
that  for  some  basic  machining  decisions.  The  MetCAPP’s  interface  provides  access  to  a 
larger  knowledge  base  consisting  of  an  extensive  material  and  tooling  database,  and  a 
mature  expert  machining  system  but  the  TA  provides  the  expertise  for  interfacing  with 
the  ANVIL5K  CAD  and  CAM  package  and  die  necessary  creation  of  the  tool  paths  that 
are  the  necessary  first  step  toward  NG  code. 

The  utilization  of  the  strengths  of  both  packages  is  a  true  application  of 
integrated  manufacturing  systems  and  hopefully,  a  look  into  the  future  of  C1M  systems. 
The  greatest  problem  facing  computer  integrated  manufacturing  has  been  the  inability  of 


7  A.  Al-Ghanim  and  L.  Cox,  “A  Process  Planning  Technique ...” 
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CAD/CAM  and  CAPP  computer  packages  to  interface  with  each  other.8  The  latest 
application  showed  the  feasibility  of  it  if  the  packages  provide  the  interfaces  The 
creators  of  MetCAPP  and  ANVIL  understood  that  requirement  and  provided  the 
interfaces  for  the  user  to  customize  the  application  and/or  link  applications  together 
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Documentation  of  MetCAPP/TA  Interface 


I.  Objective 

The  objective  was  to  prove  the  feasibility  of  utilizing  an  interface  between  a 
commercial  process  planning  database,  in  this  case  MetCAPP  and  the  Turning  Assistant, 
a  NMSU  Integrated  Manufacturing  Systems  Laboratory  (IMSL)  developed  integrated 
feature  based  CAD/CAM  process  planning  package.  The  Turning  Assistant  previously 
used  a  developed  knowledge  base  to  determine  the  appropriate  speeds  and  feeds  based 
upon  the  type  of  machining  operation,  the  part’s  material  type  and  the  tool  class.  Yet,  the 
knowledge  base  was  limited  to  the  developer’s  expert  knowledge.  The  MetCAPP 
database  in  comparison  is  a  vast  resource,  already  developed  and  externally  maintained. 

II.  Background 

The  Turning  Assistant  (TA)  is  an  NMSU  Integrated  Manufacturing  Systems 
Laboratory  (IMSL)  developed  project  based  upon  the  another  IMSL  project,  the  Milling 
Assistant  (MA).  The  TA  did  for  turning  operations  what  the  MA  performed  for  milling 
operations,  it  took  a  completed  CAD  drawing  and  based  upon  user  inputs  used  a  case 
based  reasoning  system  to  automate  the  decision  process  necessary  to  develop  NC  tool 
paths.  Both  of  these  packages  used  a  GRAPL IV  application  interface  to  access  the 
knowledge  base.  Utilizing  the  user’s  inputs,  the  appropriate  machining  operation 
decisions  are  then  passed  back  to  the  CAD/CAM  package  ANVIL5K,  where  system 
parameters  are  then  set  and  tool  paths  based  on  those  parameters  developed.  These  tool 
paths  are  output  as  .clp  files,  and  through  a  post  processor  utilized  to  develop  the  NG 
code  that  directs  the  machining  of  the  part. 

MetCAPP  is  an  established  commercial  process  planning  package.  It  is  made  up 

of  CUTPLAN,  a  process  planning  module,  CUIDATA,  the  machining  parameter 

information  database,  and  CUTTECH,  the  MetCAPP’ s  expert  machining  system.  All  of 

these  integrated  pieces  provide  the  planner  with  data  on  the  time  and  cost  impacts  of 
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various  machining  decisions.  For  example,  through  CUTDATA,  recommendations  are 
provided  on  cutter  diameter,  speed,  feed,  tool  material,  machine  rpm  and  ipm,  and  power 
based  on  user  inputs  of  operation,  material  and  tool  class.  These  can  duly  be  input  into  a 
CUTPLAN  matrix  where  these  inputs  are  translated  into  cutting  time,  labor  hours, 
material  used,  and  machine  hours.  What  MetCAPP  does  not  do  is  provide  these  inputs 
from  a  feature  based  model. 

m.  Scope 

A  menu  was  added  to  the  TA  where  the  user  was  queried  as  to  whether  they 
wanted  to  utilize  MetCAPP.  The  user  was  then  prompted  for  the  required  inputs  of 
material  type,  machining  operation,  and  tool  type.  These  inputs  were  passed  to  TA 
building  on  an  interface  developed  by  Gordon  Hunt.  Hunt  initially  developed  an 
interface  between  ANVIL5K  and  METCAPP  that  extracted  base  speed  and  feed  data 
based  upon  the  initial  inputs.  These  outputs  were  in  a  format  which  with  a  conversion 
could  be  used  as  the  necessary  machine  speeds  and  feeds  required  by  Anvil’s  CAM 
package.  If  the  user  utilized  the  MetCAPP  option  for  obtaining  speed  and  feed  data,  the 
existing  knowledge  base  resident  in  the  TA  was  not  accessed  for  that  specific 
information. 

IV.  Methods 

The  NC  Tum  was  modified  to  incorporate  a  CALL  statement  to  machdatagrs. 
Machdata.grs  was  the  top  tier  program  that  then  called: 

•  subroutine  ‘submat’  for  user  definition  of  the  part  material. 

•  subroutine  ‘subtool.grs’  for  user  definition  of  the  type  of  tooling. 

•  subroutine  ‘  subop.  grs’  for  user  definition  of  the  type  of  operation.  These  were 
limited  to  turning  operations  since  the  application  was  being  utilized  in  the  TA. 

The  operation,  tool  type  and  operation  are  then  passed  through  to  ‘getcapp’  via  a 

XCALL  function.  ‘Getcapp’  is  a  Standard  C  program  that  opens  communication 

between  the  MetCAPP  CUTDATA  database  and  the  function  ‘cappdatax.'  ‘Cappdatac’ 

controls  the  functions  that  utilize  the  MetCAPP  Application  Programming  Interface  to 
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create  a  new  session  of  CUTDATA  and  pulls  the  required  data  out  and  stores  it  in  various 
buffers  as  matches  for  key  files  are  found  Cappstor.txt  is  then  opened  to  write  the 
operational  inputs  and  the  outputs  to  a  file.  If  no  match  is  found  then  an  error  flag  is 
returned.  The  program  ends  and  control  is  again  returned  to  ‘getcapp.’ 

A  function  called  ‘dataout’  is  initiated  which  opens  cappstor.txt  and  transforms 
the  character  data  to  either  string,  integer,  or  double  precision  is  performed  on  every 
variable  read  from  the  file.  The  GRAPL-IV  program  machdata.grs  then  receives  the 
data  back.  The  modals  are  set  with  the  new  parameters  for  speeds  and  feeds.  Within 
machdata.grs  a  flag  is  set  to  indicate  that  the  selection  of  material  and  tools  has  already 
occurred.  Control  of  the  program  is  given  back  to  NC  Tum  and  the  tool  paths  are 
selected  based  upon  the  MetCAPP  inputs. 

Turning  Assistant  Changes 

The  changes  to  the  Turning  Assistant  included: 

•  A  query  and  menus  for  selection  of  turning  operation,  part  material,  and  type  of  tool. 

•  Flag  check  to  see  if  the  above  selections  were  done  prior  to  entering  the  TA  decision 
processor. 

•  Added  a  conversion  factor  to  correct  the  MetCAPP  inputs  to  ipm  or  rpm  instead  of 
ifin  as  his  program  input  them. 

•  Added  a  subroutine  to  verify  the  machine  tool  limitations  with  the  MetCAPP  inputs.  The 
MetCAPP  expert  system  only  takes  into  account  the  type  of  process  and  the  part  material 
in  its  calculations  of  the  various  speeds  and  feeds.  This  caused  problem  when  the 
machine  tool’s  capabilities  were  Ihe  limiting  factor  to  the  speeds  and  feeds.  The  TA  was 
modified  to  make  a  comparison  between  the  MetCAPP  inputs  and  the  TA’s  tool  database 
so  that  the  modal  change  only  incorporated  the  limiting  speed  or  feed. 
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V.  Conclusion 

a  fa,  It  T"  °f  ^ iMeACe  betWeen  ^  CAD/CAM  P**^’ a  CAPP  «*m  and 
knowledge  based  processor  is  a  due  example  of  the  potential  for  integmted 

m^acmnng  software.  Only  recently  have  manufacture  software  packages  even 

“  “T t0  Pr0V,de  m,erfaCeS  °r  ^  CUS“0”  *»  <°  extra  money 
m  performing  the  customization  or  forcing  users  to  buy  huge  expensive  packages 

Therefore,  MeCAPP  and  Anvil  5K  are  unique  by  provrding  application  interfaces  that 

e  user  to  expand  their  capabilities  and  customize  their  use  of  the  software 

ization  of  the  strengths  of  both  packages  is  a  true  application  of  integrated 
manufacturing  systems  and  hopefully,  a  look  into  the  future  of  Of  systems  The 

prob'emfacing  computer  integrated  manufacturing  has  been  the  inability  of 
an  CAPP  computer  packages  to  interface  with  each  other.'  This  application 
Showed  die  feasibility  of  i,  if  the  packages  provide  die  interfaces. 


'  J.  de  Vries,  O.W.  Salomons,  A.H. 
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ABSTRACT 

This  paper  presents  analysis  and  results  of  implementing  two  machining  knowledge 
acquisition  methodologies.  The  study  is  part  of  the  machining  strategy,  machining  assistant  and 
process  planning  research  efforts  being  conducted  at  NMSU-Integrated  Manufacturing  Systems 
Laboratory  (IMSL).  Goals  of  the  project  aim  at  improving  and  advancing  the  solicitation, 
documentation,  and  automation  of  machining  knowledge.  To  achieve  these  goals,  this  research 
project  has  three  major  interrelated  phases  through  which  the  intended  work  has  been  conducted. 
These  phases  are  machining  knowledge/data  acquisition,  automation,  and  integration  with 
CAD/CAM/CAE  systems.  This  paper  emphasizes  the  knowledge  acquisition  phase  of  the  study. 

Designing  and  developing  knowledge  based  systems  for  Computer  Numerical  Control 
(CNQ  machining  operations  is  a  very  formidable  task  due  to  the  wide  variety  of  part  designs 
and  the  inability  to  anticipate  local  operating  constraints.  What  works  in  one  place  does  not 
necessarily  work  in  another.  The  steep  learning  curves  and  high  costs  for  implementing  and 
using  CAD/CAM  has  resulted  in  a  very  slow  shift  toward  computerized  machining.  In  order 
to  cut  lead  times  and  reduce  costs  for  operating  CNC  equipment,  systems  are  needed  which  can 
be  easily  tuned  and  adjusted  during  implementation  and  then  continuously  improved. 

KEY  WORDS:  (CAD/CAM,  CAPP,  CNC,  Machining  Strategy,  Knowledge  Base) 

INTRODUCTION 

Planning  and  developing  Numerical  Control  (CNQ  programs  for  the  production  of 
mechanical  parts  is  an  expensive  and  time-consuming  process.  Part  geometry,  design 
specifications,  local  expertise  and  operating  constraints  form  the  basis  for  the  selection 
machining  strategies  which  must  be  translated  (by  one  of  various  methods)  into  specific  machine 
controller  instructions.  A  process  engineer  (domain  expert),  who  traditionally  performs  these 
functions  manually,  routinely  applies  learned  skills  to  prescribe  manufacturing  operational  steps 
and  specific  machining  parameters. 

In  one  investigation  the  project  staff  developed  a  prototype  neural  network  model  that 
could  handle  few  Process  Planning  (PP)  functiortg.  Iterative  analysis  of  this  model  served  two 
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objectives.  First,  the  development  process  of  this  pilot  system  provided  a  new  unovanve 
machining  knowledge  acquisition  methodology.  Hus  neural-based  ventrne  reduced  the  burden 
of  constructing  general  rules  into  a  simple  data  collection  proofs.  Our  effort  resulted  in 
designing  and  utilizing  a  set  of  effective  instrumental  machining  data  collection  fonns.  And, 
second,  the  model  provided  a  promising  neural-based  PP  system  capable  of  automating  suitable 

planning  functions. 


One  main  focus  is  to  document,  analyze,  and  classify  various  machining  mput  paiaireters 
and  techniques  accumulated  through  literature  review  and  interviews  with  experts.  TheflXEF3 
(ICAM  Definition  Language  3)  structured  model  of  the  thought  process  followed  by  a  -CNC 
programmer  helps  maintain  a  consistency  throughout  the  machining  assistants  because  it  can  be 
used  to  acquire  and  represent  knowledge,  format  the  decision  processor,  and  assist  the  user  in 
reviewing  the  reasons  for  suggested  methods  after  a  tool  path  is  built. 


Commercial  CAD/CAM/CAPP  software  do  not  currently  facilitate  the  automatic 
accumulation  of  design  and  manufacturing  data  produced  throughout  the  planning  and  production 
of  parts.  Today’s  CNC  machine  tool  programming  methods  are  inefficient  and  do  not  utilize 
the  product  model  concept.  Plans  are  often  produced  and  executed  on  the  fly,  modified  on  only 
the  final  hard  copy,  and  inaccessible  by  people  responsible  for  job  costing.  Furthermore, ,  the 
-intelligence  and  know-how  to  manufacture  and  track  parts  resides  in  wide  base  of  skmed 
craftsmen  (and  women),  process  engineers  and  manufacturing  experts.  Advanced  NC 
programming  systems  will  be  necessary  to  exercise  the  use  of  a  product  model. 

Machine  tool  manufacturers  have  benefitted  greatly  over  the  years  from  customer 
feedback.  Shared  techniques,  machinability  tests,  and  performance  evaluations  have  educated 
and  diversified  them  in  the  areas  of  computer  hardware  and  CNC.  Many  machmes  have 
powerful  built  in  programming  systems  that  compete  closely  with  the  option  of  CAD/CAM.  The 
computer  hardware,  software,  and  communication  links  built  in  as  part  of  the  machine  tend  to 
be  primitive.  The  problem  is  that  these  machines  and  controller  interfaces  end  up  being  the  place 
where  all  process  planning  occurs.  One  term  used  to  describe  it  is  "programming  at  the 
spindle."  Operations  can  become  dependent  on  a  certain  machine/operator  and  programs  or 
parts  cannot  be  re-routed  in  case  of  breakdowns  or  scheduling  problems.  There  is  no 
intermediate  "generic"  cutter  location  data  file  which  can  be  postprocessed  for  an  alternative 
"machine  with  similar  capabilities.  Furthermore,  there  is  no  level  of  control  past  the  machine 
interface.  Design,  planning,  manufacturing,  and  inspection  data  is  difficult  to  link  when  so 
much  of  it  resides  in  the  relatively  inaccessible  machine  controller  unit.  Operators  traditionally 
are  very  good  verbal  communicators  but  have  limited  writing  and  documentation  training. 
Commercial  CAD/CAM  gets  limited  acceptance  among  operators/NC  programmers  who 
perceive  that  their  jobs  are  being  replaced  by  computers. 

Process  Planning.  Process  Planning  (PP)  is  defined  by  the  society  of  manufacturing 
engineers  as  ’the  systematic  determination  of  the  methods  by  which  a  product  is  to  be 
manufactured  economically  and  competitively*.  The  task  of  process  planning  involves  a  senes 
of  steps,  the  first  of  which  is  the  interpretation  of  the  product  design  data  which  consists  of 
geometric  configuration,  dimensions  and  tolerances,  and  raw  material  properties.  ^Second, 
machine  tools  capable  of  performing  the  required^rocesses  will  have  to  be  selected  based  on 
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machine  tools  availability,  capability,  range  of  machining  operations,  required  production  rates, 
etc..  The  operation  sequence  is  then  determined  based  on  groups  of  parts  to  be  produced.  The 
tool  sets,  jigs  and  fixtures,  and  cutting  conditions  such  as  the  depth  of  cut,  feed  rate,  speed  of 
cutting  tool  have  to  be  determined.  Finally  process  sheets  (route  sheets)  and  part  programs  are 
developed,  taking  into  consideration  the  aforementioned  factors  [2].  In  summary,  process 
planning  for  machining  can  be  itemized  as  a  sequence  of: 

•  Interpretation  of  product  design  data. 

•  Selection  of  machining  processes,  machines  tools,  and  cutters. 

•  Determination  and  design  of  jigs  and  fixtures. 

•  Sequencing  of  the  selected  operations. 

•  Determination  of  proper  cutting  conditions. 

•  Generation  of  process  sheets  and  numerical  control  (NC)  part  programs  (in  the  case 
of  automated  manufacturing). 

Thus,  a  process  planner  is  a  manufacturing  subsystem  responsible  for  converting  design 
data  into  work  instructions.  It  represents  the  link  between  engineering  design  and  shop  floor 
manufacturing,  and  determines  the  manufacturing  operations  required  to  transform  a  part  from 
a  rough  state  to  a  finished  state  specified  by  the  engineering  drawing.  In  discrete-part  metal¬ 
cutting  operations  (e.g.  milling,  turning,  grinding,  drilling.. etc.),  a  number  of  computational 
procedures  (e.g.  mathematical  models,  heuristics)  is  required  to  determine  the  above  itemized 
macro-level  tasks  and  associated  micro-level  decision  variables.  For  example,  based  on  the  part 
identifiable  manufacturing  features  (or  machining  surfaces),  required  finish  and  tolerances, 
production  rate,  and  available  machine  capabilities,  an  appropriate  (perhaps  optimal)  production 
process(s)  can  be  selected  to  machine  that  feature.  Once  a  process  is  selected  then  utilizing  such 
variables  as  part  material  characteristics  (e.g.  hardness,  machinability),  part  dimensions,  cost 
of  raw  material,  cutting  cost,  a  set  of  cutting  conditions  like  cutting  speed,  feed  rate,  depth  of 
cut,  number  of  passes,  and  usage  of  coolants  can  be  determined. 

Process  Planning  Automation.  Computer  Aided  Process  and  operations  Planning 
(CAPP)  systems  attempt  to  automate  the  decision  making  process  necessary  to  map  design 
specifications  on  the  company  specific  manufacturing  environment.  Many  CAPP  systems  utilize 
some  form  of  Artificial  Intelligence  (AI)  to  model  the  human  interaction  previously  required  by 
the  task.  CAD/CAM  software  and  databases  are  used  to  supply  product  descriptions  and 
information  management,  respectively.  Material  removal  operations  provide  a  rich  environment 
for  the  implementation  of  intelligent  applications.  Much  of  the  current  work  in  this  field 
concentrates  on  the  AI  technique  used  to  process  the  knowledge  and/or  the  solid  model  design 
features  utilized  by  the  engineer  to  initially  create  the  part  [1, 5,7,8]. 

Process  planning  is  a  difficult  task  to  automate  because  it  comprises  of  a  series  of 
functions  that  do  not  submit  ’nicely’  to  modeling  techniques.  Nevertheless,  there  are  two  well- 
known  systematic  computer-aided  process  planning  (CAPP)  strategies,  namely,  variant  planning 
and  generative  planning.  The  distinguishing  feature  of  the  variant  planning  strategy  is  that 
former  plans  are  retrieved  and  modified  for  new  parts,  while  generative  planning  is  a  strategy 
that  strives  to  create  new  part  plans  from  scratch  based  on  analyzing  part  geometry  and  other 
related  specifications  [4].  A  hybrid  of  the  variant  and  generative  is  common. 

1-44 
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Developing  knowledge  acquisition  and  representation  tools  which  allow  information 
pertaining  to  specific  features  to  be  documented  and  evaluated  is  viewed  as  an  important  part 
of  a  production-worthy  CAPP  system.  Generative  AI  programming  languages  all  require  the 
same  knowledge  as  the  basis  for  structuring  a  decision  processor,  regardless  of  the  technique. 
Many  CAPP  systems  require  AI  programmers  to  modify  and  enhance  the  rule  bases.  Breaking 
the  decision  process  down  into  small  steps  allows  for  a  procedural  approach  to  the  actual 
decision  processing.  Knowledge  attained  during  meetings  with  experts  along  with  the  resulting 
rules  formulated  for  that  particular  decision  can  be  stored  in  the  model. 

Process  Flow  Model.  In  1981,  the  U.S.  Air  Force  Integrated  Computer-Aided 
Manufacturing  (IC  AM)  program  developed  structured  methods  for  applying  computer  technology 
to  manufacturing.  These  methods  were  designed  to  better  understand  how  to  best  improve 
manufacturing  productivity.  There  are  three  well-documented  modeling  methodologies  defined 
and  developed  around  the  ICAM  Definition  (IDEF)  approach  to  system  study;  IDEFO,  IDEF1, 
and  IDEF2. 

The  Air  Force  contractor  responsible  for  the  continued  development  and  maintenance  of 
the  public  domain  IDEF  system/software  definition,  design,  and  engineering  methods  is 
Knowledge  Based  Systems  (KBS)  Inc.  (College  Station  TX).  KBSI  has  expanded  the  original 
IDEF  languages  to  include  IDEF3-IDEF6. 

The  IDEF  methodology  can  be  applied  as  a  descriptive  tool  or  an  analysis  tool.  The 
descriptive  tool  uses  heuristic  models  and  is  applied  to  complex  problems  such  as  manufacturing 
processes.  The  analysis  tool  requires  a  more  in-depth  evaluation  and  is  applied  to  more 
narrowly  defined  problems  such  as  business  operations  or  practices.  The  idea  is  to  establish  a 
common  language  to  enhance  communications  and  evaluation  and  to  stimulate  organizational 
learning.  It  uses  a  graphical  approach  to  describe  the  manufacturing  functions  of  a  system.  A 
standard  notation  has  been  developed  that  includes  boxes  or  blocks  to  represent  activities  or 
functions  and  various  connecting  arrows  that  have  specific  representational  purposes. 

The  IDEF3  model  is  similar  to  a  decision  tree  with  information  boxes  associated  to  each 
decision  point.  The  goal  is  to  concentrate  on  the  structuring  of  the  specific  thought  processes 
followed  by  NC  programmers.  In  this  way,  the  information  necessary  to  build  a  decision 
processor  can  be  stored  independently  of  the  AI  technique  utilized.  This  research  utilized 
graphical  models  and  a  database  for  holding  machining  strategies  acquired  from  literature  and 
interviews. 

Product  Models.  The  complete  electronic  product  definition  for  a  manufactured  part 
consists  of  all  the  information  related  to  designing,  producing,  inspecting,  and  managing  the 
manufacture  of  fhat  part.  Efficient  utilization  of  these  articles  of  data  would  facilitate  the  rapid 
production  of  the  item  using  computerized  machine  tools  and  processes.  National  efforts  are 
underway  to  define  standards  for  complete  product  models.  A  product  model  is  defined  as  a 
computer  representation  of  a  part  or  assembly  containing  the  combined  information  associated 
with  CAD,  CAM,  CAE,  CAI,  process  planning  and  job  tracking.  It  is  the  basis  for  Computer 
Integrated  Manufacturing  (CIM),  Concurrent  Engineering  (CE),  and  Design  For 
Manufacturability  (DFM)  studies.  Figure  1  idfnt^f es  the  major  areas  and  critical  components 


295 


^CAPOS 


Pt  w*ft*t*ft* 

Di|'>t|Wf  ’  §| 

/.T;W: ’*>•.' '::•: 

T"i  n  I  eho*  :, 

;!KiU:t*r  ia  r  V': 

0  m  0  n  t .  1 1  y 
•  Fio  P;r  *  jo  i  y  p  • 

Vf  suat  t z«t  tor* 


Jo|> 

M&or. |p|i»ijiiitii:l .. 

M<tr*r*a$  Uqt*\  ||R| 

M  iMHBl 

•  SKIppln<*/Ro**i  V  l;htf 

Pro^Mr*******: 

Markotfn? 


li 


ii 


mii 

::illl®p;- 


IE 


ii 


. IGBI 

IdataI 


m 


Ii 


C53 

MANAGEMENT 


TABUE  Or  CCWimTsT^^ 


:°  FlttlU 

H  * 

*  .:•  *fNt4  •  .-: 

:  :#■  “i.p^O  t  ir  S  I  ••• 

<*  Md««  Fr«|i*r!U0 

D«t^»i.F#r:  N<mu  f'aeiwr* 

.<*  ♦  fit 


CONCURRENT | 
ENOiNEERING 


STANDARDS 

>  PDES 

>  STEP 

S  r:miis  1 

>  ICES 
I  ANSI 


?;Sre:Pr«.dgr.* 
«■  CMU  Data 

4*  lrn|>*cM’On 

II 

divarication  1 


mechanical: 

PARTS  : 


f+aluroe  ,: 

Mach  1  n*£  •' 

;^5 1. y Upo. 

NC 

;•  *  Slrat#«)*s 

’  PuriaMAUrc! 

;;":  •'  ♦  Voir  I  :'f  l««i  I  o  tv’;: 


TABLE  OF  OOHTOirS*  C.  :\  \ 

Copacl fy  Planning 
Control  .  '; 

SchodoHn* 

::;i;n^Pr.^o;i>^:.  Iriepo^t ion 
F+odbaok  •  MochanI  CMC  •• 
P*j jr*f ormdrtc*  Rating 
•  Op d  ra  t  or  '*  trio  f  r  mc  1 1  on  •  • 
Con  1 1  ng oncy  Plane :  ..  ...: 


Figure  1:  Components  of  the  Complete  IJro^ict  Model 


296 


within  each  area  necessary  to  build  a  complete  product  model  for  a  mechanical  part.  The  file 
folders  represent  product  data  that  should  be  stored  in  a  common  area  so  that  it  can  be  shared 
throughout  the  operation.  \v 

The  data  provided  by  the  product  model  must  be  accessible,  well  organized,  and  conform 
to  standards  prevalent  in  both  industry  and  government.  In  many  ways,  the  product  model  is 
unobtainable  because  it  relies  on  the  unrestricted  sharing  of  data  throughout  the  design, 
engineering,  and  manufacturing  processes.  Many  aspects  of  converting  raw  material  into  a 
finished  parts  are  difficult  to  track  on  the  shop  floor.  The  Computer  Aided  Acquisition  and 
Logistics  Support  (CALS)  and  Product  Definition  Exchange  Specification  (PDES)  /  Standard  for 
the  Exchange  of  Product  Data  (STEP)  initiatives  are  currently  addressing  these  very  issues. 

MACHINING  STRATEGY 

"The  first  principle  of  knowledge  engineering  is  that  the  problem  solving  power  exhibited 
by  an  intelligent  agent’s  performance  is  primarily  the  consequence  of  its  knowledge  base,  and 
only  secondarily  a  consequence  of  the  inference  method  employed.  Expert  systems  must  be 
knowledge-rich  even  if  they  are  methods-poor.  This  is  an  important  result  and  one  that  has  only 
recently  become  well  understood  in  AI.  For  a  long  time  AI  has  focused  its  attentions  almost 
exclusively  on  the  development  of  clever  inference  methods;  almost  any  inference  method  will 
do.  The  power  resides  in  the  knowledge." 

-Edward  Feigenbaum,  Stanford  University 

Available  tooling,  machine  accuracy,  and  setup  rigidity  can  influence  one’s  decision 
drastically.  Many  times,  duplicate  tools  are  used  to  both  rough  and  finish  a  groove  (especially 
when  considering  a  large  number  of  parts)  so  that  results  can  be  repeated  consistently.  Setup 
decisions  are  based  upon  the  configuration  of  the  part,  the  material  and  the  tooling- available. 

Setup  choices  include  tool  selection,  depth/width/cut,  speed  and  feed. 

Features.  The  concept  of  "features"  plays  an  important  role  in  linking  design  to 
manufacturing.  CAM-I’s  Current  Status  of  Features  Technology  defines  features  used  in  process 
planning  for  manufacturing  as  "shapes  and  technological  attributes  associated  with  manufacturing 
operations  and  tools  [3]."  From  a  machinist’s  viewpoint,  the  first  step  in  developing  a 
machining  plan  is  to  break  the  part  into  features.  Each  feature  is  characterized  as  a  structural 
entity  whose  attributes  specify  lower-level  geometry,  topology,  tolerance,  surface  finishes,  etc. 


Tool  Selection.  Roughing  and  Finishing  tools  are  capable  of  cutting  in  a  wide  variety  of 
materials  and  orientations.  For  clearance  purposes,  the  angles  of  the  tool  and  it’s  reference  to 
the  work  and  holder  are  critical.  Although  many  tools  are  custom  ground,  standardized  tooling 
inserts  are  becoming  common  and  general  tool  definitions  can  be  identified  for  classes  of 
materials  and  conditions.  Clearance  for  chip  removal,  part  avoidance,  and  holders  is  one  of  the 
first  considerations.  Finish  cuts  for  various  materials  are  usually  between  0.010"  and  .075" 
depending  on  the  material  and  desired  finishes  and  tolerances.  "Spring"  or  "free"  passes  are 
also  common.  No  new  material  is  theoretically  cut  but,  due  to  part  and  tool  deflection,  this  pass 
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usually  cuts  0.0005 "-.001"  and  leaves  a  smooth  finish. 

Speeds  and  Feedrates.  The  subject  of  speeds  and  feeds  could  never  be  generalized  for 
all  applications  due  to  the  wide  variety  of  materials,  machines,  and  tools  available.  The  usual 
sequence  is  to  start  with  a  cutting  speed  (surface  feet  per  minute)  for  the  particular  work 
material,  calculate  an  RPM  using  the  speed  and  diameter  of  the  part,  then  use  the  RPM  and  feed 
per  tooth  for  the  tool/material  to  calculate  a  feedrate  and  are  usually  assigned  for  approach, 
intermediate,  and  exit  motions.  As  a  safety  precaution,  the  feedrate  can  be  used  to  calculate  the 
material  removal  rate  and  the  required  horsepower.  Speeds  and  feeds  are  commonly  adjusted 
once  the  operation  begins.  Sounds,  torque  readings,  and  vibration  are  key  signals  which  can  be 
used  to  tune  cutting  parameters  during  machining. 

Tool  Motion  Points.  In  any  CAM  system,  many  tool  location  points  must  be  defined. 
The  first  considerations  are  for  safety  and  non-violation  of  the  finished  part  boundaries.  Home 
or  from  points  are  defined  either  uniquely  for  a  machine  or  arbitrarily  by  the  operator  so  as  to 
reference  the  tool  path  correctly  before  beginning  the  operation.  It  is  commonly  some  fixed 
distance  from  the  part  origin  (usually  at  the  end  of  the  part  on  the  axis  of  rotation).  Approach 
points  are  used  as  an  place  where  the  tool  can  rapid  feed  to  before  entering  the  material.  The 
tool  enters  the  material  to  the  first  depth  of  cut  and  stops  at  the  entry  point.  Motions  are 
executed  to  complete  one  pass  before  the  tool  moves  to  the  exit  point.  The  exit  point  to  the 
withdrawal  point  can  either  be  an  intermediate  move  before  moving  back  to  re-enter  the  part  or 
a  place  from  which  the  tool  can  rapid  feed  back  to  the  home  location.  In  addition  there  are 
assignable  intermediate  approach  and  re-entry  points  as  are  required  for  profiles  with  multiple 
passes.  Much  of  tool  wear  occurs  when  the  tool  violates  un-represented  stock  on  entry  and  exit 
motions.  Strategic  selection  of  these  points,  along  with  the  feedrates  assigned  at  each  stage,  can 
optimize  and  ensure  the  safety  of  tool  paths. 

Tool  Selection.  Often  times  the  same  tool  is  used  for  both  roughing  and  finishing 
grooves.  In  production  situations,  the  concept  of  duplicate  tooling  is  employed  to  keep  tooling 
costs  down  and  achieve  repeatability.  The  tool  selection  process  can  be  separated  into  a  series 
of  choices  between  materials  and  geometric  attributes  associated  with  grooving  tools.  The 
choices  include  a  tool  material,  tool  width,  tool  shape,  tool  length,  and  comer  radius. 

Cut  Width  and  Depth.  The  width  of  cut  is  directly  based  on  the  material’s  machinability 
and  the  tool  selected.  The  first,  and  very  critical  cut  is  at  full  tool  width.  A  very  tough  material 
may  be  cut  at  somewhere  between  1/4  and  1/2  of  it’s  width  on  passes  following  the  plunge  cut. 
Grooves  are  cut,  most  often,  to  full  depth  on  every  pass.  It  is  different  than  say,  milling, 
because  tool  pressures  are  close  to  the  same  at  full  depth  as  when  first  entering  the  material. 
If  a  multi-depth  groove  is  encountered,  it  is  often  treated  as  two  operations,  with  a  smaller  tool 
repeating  the  same  basic  sequence  as  the  larger  tool. 

Approach  and  Entry.  The  tool  should  start  at  the  home  position  (may  be  fixed  or  reset 
at  some  fixed  distance  from  the  origin  of  the  part)  and  move  at  rapid  feed  over  to  .020"  away 
from  the  feature.  The  wall  closest  to  either  the  chuck  or  the  center  should  be  used  at  the  start 
location  in  turning.  It  is  safe  to  move  the  tool  at  a  rapid  feedrate  over  to  within  .  100"  outside 
the  stock  outline  or  above  the  part.  The  firstim<«ion  can  be  a  direct  plunge,  at  full  tool  width. 
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to  a  depth  prescribed  or  a  ramp.  A  dwell  should  be  programmed  at  about  One  second  to  allow 
for  chip  removal. 

KNOWLEDGE  ACQUISITION  AND  REPRESENTATION 

A  machinist  in  charge  of  designing  a  tailored  process  plan  is  always  faced  with  the 
problem  of  selecting  from  a  vast  variety  of  possible  combination  values  of  machining 
parameters,  such  as  speed,  feed,  depth  of  cut,  tool  material  and  size,  tool  entry  strategy,  cutting 
strategy,... etc.  A  machinist  makes  decisions  based  on  past  machining  cases  that  he  recalls  from 
his  experience.  Very  often,  to  help  overcome  some  difficulties  of  this  unstructured  decision¬ 
making  process,  machinists  employ  some  form  of  heuristic  rules  to  distinguish  between  different 
cases;  however,  each  machining  case  can  be  unique  in  some  aspects  and  may  require  task- 
specific  solution  strategies.  "It  is  very  difficult  to  construct  general,  and  yet  simple,  rules 
capable  of  handling  all  or  most  machining  cases,"  said  Joe  Alejandrez,  a  20-year-experience 
machinist  form  the  Physical  Science  Laboratory  (PSL).  That  is  because  process  planning  is  a 
function  of  multiple  inputs  and  multiple  outputs  describing  the  part  to  be  machined,  the 
manufacturing  resources,  and  the  manufacturing  procedures.  Each  output  decision  is  driven  by 
a  set  of  raw  interacting  inputs  and  possibly  by  other  outputs,  resulting  in  a  complex 
parallel/sequential  decision  process.  The  process  is  subject  to  machinists’  preference  as  there 
is  a  wide  range  of  machining  parameters  that  can  satisfy  the  requirements  of  the  product  model. 
This  wide  range  is  the  basis  of  much  discussion  regarding  the  ’best’  approach  to  machining  a 
quality  part.  A  trained  machinist  can  effectively  recognize  and  evaluate  all  pertaining  factors 
and  work  out  a  ’good’  process  plan. 

The  most  important  and  difficult  aspect  of  this  research  is  to  develop  effective  tools  with 
which  to  elicit  knowledge  from  domain  experts.  The  skills  obtained  through  years  of 
observation  training,  improvising,  and  triai-ahd-error  are  not  easy  to  explain  verbally,  document, 
or  formalize  for  the  purpose  of  building  automated  NC  programming  systems.  A  significant 
amount  of  time  must  be  spent  in  familiarizing  oneself  with  the  field  and  terminology  before 
attempting  to  solicit  strategies  directly  from  the  NC  programmers  and  machinists.  Even  though 
technology  has  matured  very  quickly  in  the  areas  of  CNC  machining  and  CAD/CAM, 
experience  with  manual  machining,  "old  tricks,"  and  traditional  material  removal  skills  are  still 
necessary.  This  practical,  applied  way  of  thinking  cannot  be  documented  in  machining 
handbooks  or  obtained  from  questionnaires  and  interviews.  An  aged  work-force,  loss  of 
manufacturing  jobs,  and  the  disappearance  of  apprenticeship  and  mentor  programs  combine  to 
put  the  U.S.  in  danger  of  losing  a  once  dominant  technology. 

Machine  operators  and  programmers  must  work  as  a  team  in  order  for  high  standards  of 
quality  to  be  achieved.  Many  times,  process  and  design  engineers  have  not  been  included  in  this 
team.  NC  programming  functions  are  rarely  evaluated  from  an  engineering  standpoint.  Many 
times  either  technicians  and  craftspeople  or  management  decide  on  the  tools  that  will  be  made 
available  for  process  planning.  To  keep  production  going,  the  system  a  company  is  familiar 
With  becomes  a  safety  net.  New  options  are  rarely  given  a  chance  to  succeed  due  to  the  steep 
learning  curve  and  high  costs  associated  with  CAD/CAM,  NC  programming,  and  post¬ 
processing. 
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It  is  difficult  to  obtain  the  assistance  of  skilled  machinists  when  attempting  to  build  an 
automated  programming  system.  Information  is  difficult  to  communicate  verbally  and  formalize 
for  the  purpose  of  designing  a  knowledge  base.  A  key  factor  in  the  effort  lies  in  the  re-training 
of  shop-floor  workers.  If  some  decisions  they  normally  make  are  being  replaced  by  computer 
programs,  then  new  knowledge  must  be  given  in  return.  The  services  of  an  experienced  lathe 
operator  (Ken  Davis,  17  years  between  FreightLiner  and  BOEING)  for  two  months,  greatly 
contributed  to  the  success  of  the  this  project. 

This  research  helped  to  develop  knowledge  elicitation  and  representation  tools  that  can 
be  used  to  formalize  the  expertise  gained  through  machining  strategy  meetings,  literature  review, 
and  programmer  feedback.  It  includes  the  design  of  questionnaires  that  can  be  used  to  document 
strategies  in  a  set  format  during  interviews  and  observation  training. 

Developing  knowledge  acquisition  and  representation  tools  which  allow  information 
pertaining  to  specific  turned  features  to  be  documented  and  evaluated  is  viewed  as  an  important 
part  of  a  production-worthy  CAPP  system.  Generative  AI  programming  languages  all  require 
the  same  knowledge  as  the  basis  for  structuring  a  decision  processor,  regardless  of  the  technique. 
Many  CAPP  systems  require  AI  programmers  to  modify  and  enhance  the  rule  bases.  Breaking 
the  decision  process  down  into  small  steps  allows  for  a  procedural  approach  to  the  actual 
decision  processing.  Knowledge  attained  during  meetings  with  experts  along  with  the  resulting 
rules  formulated  for  that  particular  decision  can  be  stored  in  the  model.  (See  Process  Flow 
Model) 

Automatic  creation  of  NC  programs  requires;  1)  user-interaction  --  to  gather  location 
specific  operating  information,  2)  interrogation  of  the  features  geometric  description  within  the 
CAD  database,  3)  a  file  based  communication  link  to  a  decision  processor  and,  4)  An 
applications  interface  language  capable  of  building  tool  paths  from  user  generated  programs. 
Very  few  commercial  CAD/CAM  systems  provide  the  user  with  enough  power  to  accomplish 
these  tasks.  Many  CAPP  systems  have  had  to  build  their  own  primitive  CAD  system  for  input 
or  used  symbolic  languages  to  output  the  parts  geometric  description.  Obvious  extensions  to  the 
automated  process  planner  described  here  include  automatic  feature  recognition,  enhanced 
decision  processing  methods,  part  program  optimization  routines,  and  Group  Technology  (GT) 
coding  schemes  for  individual  rotational  features.  For  this  project  we  utilized  ANVIL5000 
CADD/CAM/CAE  software  for  its  application  interface  and  integrated  capabilities. 

ARTIFICIAL  NEURAL  NETWORKS 

Interest  in  ’brain  style  computing’  in  terms  of  artificial  neural  networks  (ANN)  is 
increasing.  ANNs  are  parallel  distributed  processing  architectures  composed  of  a  large  number 
of  small  interacting  elements  that  are  massively  interconnected.  Each  of  these  elements  sends 
excitatory  and  inhibitory  signals  to  other  units  that  in  turn  update  their  behaviors  based  on  these 
received  messages.  ANNs  emulate  the  functionality  of  the  human  brain  that  implicitly  stores 
knowledge  in  the  interconnection  weights,  and  not  in  the  neurons  themselves  [9].  A  typical 
network  consists  of  a  set  of  processing  elements  (PE)  grouped  hierarchically  in  layers  and 
interconnected  in  some  fashion.  These  PEs  sum  N  weighted  inputs  and  transfer  that  outcome 
through  a  mathematical  function.  One  of  the?  m$st  popular  neural  paradigms  is  the  multilayer 
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perceptron  trained  using  the  Error  Back  Propagation  Training  Algorithm  (EBPTA)  [6]. 

Neural  network  models  have  key  advantages  over  rule-based  systems.  First,  the  learning 
process,  or  the  knowledge  acquisition  process,  takes  place  by  presenting  only  sample  machining 
cases  to  the  network.  Learning  occurs  through  modifying  the  connection  weights  between 
network  elements  (by  a  learning  algorithm)  to  model  the  representation  of  the  cases  [10]. 
Knowledge  is  thus  stored  implicitly  in  the  final  interconnections  values.  Second,  because 
training  examples  need  not  be  exhaustive,  it  is  conceivable  that  neural  network  methodology 
could  ease  the  knowledge  acquisition  bottleneck  that  is  hampering  the  creation  of  rule-based 
systems  [11].  Finally,  when  utilized  as  a  decision  processor,  an  artificial  neural  has  inherent 
parallelism  resulting  in  fast  computation,  robustness  or  error  resistance,  and  adaptiveness  or 
generalization. 

A  NEW  KNOWLEDGE  ACQUISITION  METHODOLOGY 

The  adopted  research  methodology  is  aimed  at  investigating  the  utilization  of  a  neural- 
based  knowledge  acquisition  approach.  A  neural-based  process  planning  system  is  a  knowledge- 
based  system,  where  knowledge  is  stored  implicitly  in  the  network  parameters  during  the  course 
of  learning.  As  described  earlier,  learning  takes  place  gradually  when  segmented  pieces  of 
knowledge  (i.e.  training  examples)  are  presented  to  die  network  and  accumulated  in  its  memory. 
Based  on  this  operational  mode  of  neural  systems,  and  taking  advantage  of  our  knowledge  and 
experience  in  developing  rule-based  systems  for  similar  tasks  (i.e.  milling  assistant  [3]).  This 
methodology  should  satisfy  a  number  of  basic  requirements.  First,  it  must  reduce  the  burden 
associated  with  developing  rule-based  systems  by  avoiding  constructing  general  rules,  because, 
as  explained  earlier,  experts  are  not  good  at  generalizing  solutions.  Second,  this  methodology 
must  ensure  that  knowledge  be  cast  in  a  form  usable  by  neural  systems;  forms  of  rules  are 
excluded.  What  is  required  by  a  neural  system  is  a  set  of  training  examples,  where  each 
example  represents  a  real  machining  case  described  by  a  vector  of  numeric  values  of  process 
variables  encoded  in  a  convenient  from.  Third,  consequently,  the  new  methodology  must 
employ  effective  instrumental  knowledge  gathering  forms;  and  since  no  general  inferential 
statements  are  required  at  this  stage,  the  knowledge  gathering  forms  can  be  mere  data  collection 
forms.  The  basic  idea  underlying  this  procedure  is  to  enable  the  user  provide  relevant  training 
examples  only,  and  the  neural  network,  through  the  learning  algorithm,  will  implicitly  derive 
the  IF-THEN  rules. 

First,  a  neural  network  paradigm  must  be  selected  in  consensus  with  the  application  at 
hand.  In  this  research,  a  supervised  neural  network  model  is  employed  as  a  classifier.  A 
supervised  learning  model  is  selected  as  it  enables  the  expert  to  express  his  judgmental  opinion 
in  a  relative  manner.  This  feature  is  very  essential  because,  in  this  application,  it  is  not  required 
to  group  the  training  examples  into  clusters  as  would  be  the  outcome  if  an  unsupervised  learning 
model  were  used.  Instead,  every  single  training  example  be  explicitly  identified  (by  the  expert) 
as  belonging  to  a  specific  class  that  represents  a  machining  decision.  The  neural  model  is 
employed  as  a  classifier  so  that  the  output  decision  belongs  to  a  class  of  objects  (e.g.  type  tool 
material:  HSS,  Carbide,  Ceramic),  or  to  a  ranges  of  numeric  values  (e.g.  tool  size:  1/4M/2", 
l/2"-l",...).  This  method  helps  considering  the  variety  of  machining  choices  that  can  satisfy 
the  requirements  of  a  quality  product.  1-51 
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The  second  decision  is  indeed  part  of  the  knowledge  acquisition  procedure.  It  is 
concerned  with  selecting  appropriate  process  planning  decisions  parameters  that  can  be 
adequately  handled  by  the  described  neural  model.  The  parameters  chosen  for  the  purpose  of 
tins  study  served  as  real  illustrative  examples  of  the  new  methodology  implementation.  These 
functions  are  chosen  to  illustrate  our  methodology  because  the  output  of  each  function  belongs 

practice^  *  fmit6  number  of  weU'defmed  output  classes  that  are  generally  encountered  in 

.  r  °n  **  °Utput  PP  decision  variables  to  be  automated,  the  next  step  was 

to  find  all  related  PP  input  variables  that  could  affect  the  selection  of  the  output  variable.  This 
process  took  place  in  an  iterative  manner  as  follows! 

Initially,  a  long  list  of  possible  input  variables  was  constructed  that  could  collectively 
drive  the  selection  process  of  all  output  variables  simultaneously. 

It  was  found  that  not  all  selected  input  variables  do  contribute  to  the  selection  of  a 
gl^nf°.utpu!  decision  value.  Thus,  to  avoid  confusion  among  various  inputs  and  outputs,  each 
output  decision  variable  and  its  inputs  were  filtered  out  and  handled  separately.  Four  individual 
forms  were  then  designed,  one  each  for  output  decision. 

Working  with  individual  forms,  the  meanings  of  variables  needed  further  explanation 
This  was  an  important  step  as  it  a  created  jargon  of  well-defined  terms  that  were  conveniently 
used  by  experts.  This  revisited  understanding  revealed  that  some  of  the  input  variables  had  to 
be  deleted  as  they  did  not  affect  selecting  the  value  of  the  output  parameter.  At  this  point,  our 
forms  were  judged  to  provide  ’good’  instruments  to  start  collecting  data.  Each  form  deals  with 
one  output  variables,  and  only  relevant  non-redundant  input  variables. 

During  data  collection,  our  expert  machinists  tended  to  emphasize  some  input  variables 

S g-1Ving  l6SS  WCight  t0  ^  ^  variables  when  de^ing  with  other  cases. 
7?  T?  .77s  distinction  into  account,  a  new  column  was  added  to  all  data  collection  forms  to 
identify  the  weight  or  the  importance  of  an  input  variable  at  a  particular  instant.  Weights  were 
assigned  to  variables  according  to  0-10  scale  in  an  increasing  order  of  merit. 

.  -  .^sing  the  data  forms,  a  good  number  of  examples  were  constructed  (about  50)  for 

output  decision.  A!,tbls  stage> il  was  observed  that  some  input  variables  constantly  scored  low 
weight  rating  (e.g.  2,  3).  Those  variables  were  deleted  form  the  forms  as  they  were  considered 
to  represent  noise.  This  also  reduced  the  dimensionality  of  the  input  vector. 

fa  light  of  the  above  requirements,  a  neural-based  machining  knowledge  acquisition 
technique  is  designed  and  implemented  during  the  course  of  developing  a  neural  model  as  a 
supportive  decision  making  tool  in  machining  process  planning. 

CONCLUSIONS 

cm  11  Computeriz^1  machining  processes  are  slowly  becoming  commonplace  in  both  large  and 
small  companies.  Current  research  into  knowtedg*  based  process  planning  has  gained  limited 
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acceptance  in  shops  because  of  the  complexity  and  shortcomings  of  such  systems.  One  problem 
is  that  the  artificial  intelligence  technique  often  becomes  the  focus  of  academic  research  projects. 
The  complexity  of  AI  and  computer  science  can  severely  limit  a  systems’  applicability  in 
production  environments.  Machine  operators  and  programmers  must  be  included  in  the  design 
and  development  of  such  applications.  Many  systems  are  dependent  on  a  skilled  computer 
programmer  to  tune  the  knowledge  and  rules  embedded  in,  what  is  often,  very  cryptic  code. 
The  transfer  of  knowledge  between  the  process  engineer  and  the  programmer  can  be  inefficient 
and  frustrating.  Creative  inference  techniques  and  clever  knowledge  structures  cannot  solve 
problems  that  are  not  well  understood  by  the  applications’  designers. 

The  tradeoffs  between  CAD/CAM  and  traditional  NC  programming  languages  (i.e., 
MDI,  APT,  etc.)  remains  a  heated  topic  among  manufacturers.  The  inability  of  software  to 
reliably  control  production  processes  has  led  many  to  always  have  a  safety  net  of  "the  old  way." 
The  focus  is  often  taken  off  the  overall  process  and  put  on  the  machinist  who  is  planning, 
verifying,  and  executing  the  programs. 
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SUMMARY 


This  report  documents  the  work  performed  on  the  project  “Adaptive  Control  of  Man¬ 
ufacturing  Processes”  for  the  Army  Research  Office.  The  primary  technical  objectives  of 
the  project  were  as  follows: 

1. )  Develop,  implement,  and  experimentally  verify  adaptive  controllers  for  unconstrained 

and  constrained  robotic  systems. 

2. )  Develop,  implement,  and  experimentally  verify  adaptive  estimation  and  control  algo¬ 

rithms  (including  neural  networks)  for  other  important  manufacturing  systems.  Prob¬ 
lems  of  interest  include  the  real-time  estimation  of  machine  tool  wear  using  cutting- 
force  measurements,  the  identification  and  compensation  of  thermal  effects  in  machin¬ 
ing,  and  the  control  of  cutting  force  in  machining. 

The  report  is  composed  of  reprints  of  ten  recent  and  representative  papers  which  describe 
aspects  of  the  research.  A  full  list  of  the  project  publications  is  given  below;  those  included 
in  the  report  are  marked  with  an  asterisk.  Additional  information  regarding  these  papers 
or  the  project  will  be  provided  by  the  investigators  upon  request. 

PROJECT  PUBLICATIONS 
1.  Journal  Articles  and  Book  Chapters 

*  Colbaugh,  R.  and  K.  Glass,  “Decentralized  Adaptive  Control  of  Nonholonomic  Me¬ 
chanical  Systems”  (submitted  for  publication) 

Colbaugh,  R.  and  E.  Barany,  “Adaptive  Control  of  Nonholonomic  Robotic  Systems” 
(submitted  for  publication) 

Colbaugh,  R.  and  K.  Glass,  “Adaptive  Task-Space  Control  of  Flexible- Joint  Manipu¬ 
lators”  (submitted  for  publication) 

*  Colbaugh,  R.  and  K.  Glass,  “Adaptive  Compliant  Motion  Control  of  Electrically- 
Driven  Manipulators”  (submitted  for  publication) 

Colbaugh,  R.  and  K.  Glass,  “A daptiv^_ Compliant  Motion  Control  of  Manipulators 


Without  Velocity  Measurements”  (submitted  for  publication) 

Colbaugh,  R.  and  K.  Glass,  “Adaptive  Tracking  Control  of  Manipulators  Using  Only 
Position  Measurements”  (submitted  for  publication) 

Wedeward,  K.  and  R.  Colbaugh,  “Adaptive  Controllers  Which  Employ  Saturated  Error 
Update  Laws  for  Robot  Trajectory  Tracking”  (submitted  for  publication) 

Colbaugh,  R.  and  K.  Glass,  “Adaptive  Stabilization  of  Mechanical  Systems  Using  Only 
Configuration  Measurements”  (submitted  for  publication) 

Barany,  E.  and  R.  Colbaugh,  “Adaptive  Stabilization  of  Mechanical  Systems”  (sub¬ 
mitted  for  publication) 

Barany,  E.  and  R.  Colbaugh,  “Adaptive  Control  of  Chaotic  Dynamical  Systems”  (sub¬ 
mitted  for  publication) 

Flachs,  S.  and  R.  Colbaugh,  “Neural  Network  Control  of  Thermal  Expansion”  (sub¬ 
mitted  for  publication) 

Colbaugh,  R.,  K.  Glass,  and  E.  Barany,  “Adaptive  Regulation  of  Manipulators  Using 
Only  Position  Measurements”,  International  Journal  of  Robotics  Research ,  Vol.  15, 
1996  (in  press) 

Seraji,  H.  and  R.  Colbaugh,  “Force  Tracking  in  Impedance  Control”,  International 
Journal  of  Robotics  Research ,  Vol.  15,  1996  (in  press) 

Colbaugh,  R.  and  K.  Glass,  “Decentralized  Adaptive  Control  of  Electric  ally- Driven 
Manipulators”,  Journal  of  Intelligent  Control  and  Systems,  Vol.  10,  1996  (in  press) 

Colbaugh,  R.,  E.  Barany,  and  K.  Glass,  “Adaptive  Control  of  Constrained  Robotic 
Systems  for  Waste  Management  Applications”,  International  Journal  of  Environmen¬ 
tally  Conscious  Design  and  Manufacturing ,  Vol.  5,  1996  (in  press) 

Colbaugh,  R.,  K.  Wedeward,  and  A.  Engelmann,  “Adaptive  Compliant  Motion  Control 
of  Manipulators:  Theory  and  Experiments”,  Journal  of  Robotic  Systems ,  Vol.  13,  1996 
(in  press) 

Colbaugh,  R.  and  K.  Glass,  “A  Robust  Approach  to  Real-Time  Tool  Wear  Estimation”, 
International  Journal  of  Robotics  and  Subornation,  Vol.  11,  No.  1,  1996  (in  press) 


Seraji,  H.,  R.  Colbaugh,  and  K.  Glass,  “Control  of  a  Serpentine  Robot  for  Inspection 
Tasks”,  NASA  Tech  Briefs  Journal ,  Vol.  20,  1996  (in  press) 

Colbaugh,  R.,  K.  Glass,  and  E.  Barany,  “A  Mechatronic  Systems  Approach  to  Con¬ 
trolling  Robotic  Systems  With  Actuator  Dynamics”,  invited  chapter  in  Mechatronic 
System  Techniques  and  Applications ,  Gordon  and  Breach  International  Series  in  En¬ 
gineering,  Technology,  and  Applied  Science,  1996 

Colbaugh,  R.,  K.  Glass,  and  E.  Barany,  “Adaptive  Stabilization  and  Tracking  Control 
of  Electrically-Driven  Manipulators”,  Journal  of  Robotic  Systems ,  Vol.  13,  No.  4,  1996 

Colbaugh,  R.,  K.  Wedeward,  K.  Glass,  and  H.  Seraji,  “New  Results  on  Adaptive 
Impedance  Control  for  Dexterous  Manipulators”,  International  Journal  of  Robotics 
and  Automation ,  Vol.  11,  No.  1,  1996 

Colbaugh,  R.,  K.  Glass,  and  H.  Seraji,  “Performance-Based  Adaptive  Tracking  Control 
of  Robot  Manipulators”,  Journal  of  Robotic  Systems,  Vol.  12,  No.  8,  1995 

Colbaugh,  R.,  H.  Seraji,  and  K.  Glass,  “Adaptive  Compliant  Motion  Control  for  Dex¬ 
terous  Manipulators”,  International  Journal  of  Robotics  Research ,  Vol.  14,  No.  3, 
1995 

Glass,  K.,  R.  Colbaugh,  D.  Lim,  and  H.  Seraji,  “Real-Time  Collision  Avoidance  for 
Redundant  Manipulators”,  IEEE  Transactions  on  Robotics  and  Automation,  Vol.  11, 
No.  3,  1995 

Colbaugh,  R.  and  K.  Glass,  “Decentralized  Adaptive  Control  of  Mechanical  Systems”, 
Intelligent  Automation  and  Soft  Computing:  An  International  Journal,  Vol.  1,  No.  3, 
1995 

Colbaugh,  R.  and  K.  Glass,  “Robust  Adaptive  Control  of  Redundant  Manipulators”, 
Journal  of  Intelligent  and  Robotic  Systems,  Vol.  14,  No.  1,  1995 

Colbaugh,  R.  and  K.  Glass,  “Decentralized  Adaptive  Compliance  Control  of  Robot 
Manipulators”,  Robotica,  Vol.  13,  No.  5,  1995 

Colbaugh,  R.,  “Stability  Analysis  for  a  Class  of  Neural  Networks”,  International  Jour¬ 
nal  of  Robotics  and  Automation,  Vol.  10,  Vol.  4,  1995 

Colbaugh,  R.,  K.  Glass,  and  H.  Seraji,  ^‘Rbbust  Adaptive  Control  of  Lagrangian  Sys- 


terns”,  Computers  and  Electrical  Engineering:  An  International  Journal ,  Vol.  21,  No. 
4,  1995 

Colbaugh,  R.,  H.  Seraji,  and  K.  Glass,  “A  New  Class  of  Adaptive  Controllers  for  Robot 
Trajectory  Tracking”,  Journal  of  Robotic  Systems ,  Vol.  11,  No.  8,  1994 

Colbaugh,  R.,  H.  Seraji,  and  K.  Glass,  “Decentralized  Adaptive  Control  of  Manipula¬ 
tors”,  Journal  of  Robotic  Systems,  Vol.  11,  No.  5,  1994 

Colbaugh,  R.,  K.  Glass,  and  M.  Jamshidi,  “Advanced  Controller  Development  for 
Robotic  Waste  Management”,  International  Journal  of  Environmentally  Conscious 
Design  and  Manufacturing ,  Vol.  3,  No.  3-4,  1994 

Colbaugh,  R.,  “Adaptive  Point-to-Point  Motion  Control  of  Manipulators”,  Interna¬ 
tional  Journal  of  Robotics  and  Automation ,  Vol.  9,  No.  2,  1994 

Colbaugh,  R.,  K.  Glass,  and  P.  Pittman,  “Adaptive  Control  for  a  Class  of  Hamiltonian 
Systems”,  Computers  and  Electrical  Engineering:  An  International  Journal ,  Vol.  20, 
No.  1,  1994 

Colbaugh,  R.,  K.  Glass  and  H.  Seraji,  “Direct  Adaptive  Control  of  Robotic  Systems”, 
Journal  of  Intelligent  and  Robotic  Systems ,  Vol.  9,  No.  1,  1994 

Colbaugh,  R.  and  K.  Glass,  “Adaptive  Control  of  Manipulators  Handling  Hazardous 
Waste”,  Radioactive  Waste  Management  and  Environmental  Restoration,  Vol.  18,  No. 
1,  1994 

Colbaugh,  R.,  J.  Baca,  and  M.  Jamshidi,  “Decentralized  Adaptive  Compliance  Con¬ 
trol  of  Manipulators  for  Waste  Management  Applications”,  Chapter  6,  Robotics  and 
Remote  Systems  in  Hazardous  Environments ,  Prentice  Hall,  1994 

Akbarzadeh,  M.,  K.  Kumbla,  M.  Jamshidi,  and  R.  Colbaugh,  “Intelligent  Control 
of  Flexible  and  Redundant  Robots”,  Chapter  5,  Robotics  and  Remote  Systems  in 
Hazardous  Environments,  Prentice  Hall,  1994 


2.  Refereed  and  Invited  Conference  Publications 


Colbaugh,  R.  and  K.  Glass,  “Decentralize^  Adaptive  Stabilization  and  Tracking  Con- 


trol  of  Electrically-Driven  Manipulators” ,  Proc.  1996  IFAC  Triennial  World  Congress , 
San  Francisco,  CA,  July  1996 

Glass,  K.  and  R.  Colbaugh,  “Sensor-Based  Coordinated  Motion  Control  of  Redundant 
Manipulators”,  Proc.  Sixth  International  Symposium  on  Robotics  and  Manufacturing , 
Montpellier,  France,  May  1996  (invited  paper) 

Colbaugh,  R.  and  K.  Glass,  “Robust  Adaptive  Regulation  of  Manipulators:  Theory 
and  Experiments”,  Proc.  1996  IEEE  International  Conference  on  Robotics  and  Au¬ 
tomation ,  Minneapolis,  MN,  April  1996 

Colbaugh,  R.  and  K.  Glass,  “Adaptive  Compliant  Motion  Control  of  Manipulators 
Without  Velocity  Measurements”,  Proc.  1996  IEEE  International  Conference  on 
Robotics  and  Automation ,  Minneapolis,  MN,  April  1996 

*  Glass,  K.  and  R.  Colbaugh,  “Real-Time  Tool  Wear  Estimation  Using  Cutting  Force 
Measurements”,  Proc.  1996  IEEE  International  Conference  on  Robotics  and  Automa¬ 
tion ,  Minneapolis,  MN,  April  1996 

Colbaugh,  R.  and  K.  Glass,  “Adaptive  Tracking  Control  of  Manipulators  Using  Only 
Position  Measurements”,  Proc.  34th  IEEE  Conference  on  Decision  and  Control ,  New 
Orleans,  LA,  December  1995 

Wedeward,  K.  and  R.  Colbaugh,  “Adaptive  Controllers  Which  Employ  Saturated  Er¬ 
ror  Update  Laws  for  Robot  Trajectory  Tracking”,  Proc.  34th  IEEE  Conference  on 
Decision  and  Control,  New  Orleans,  LA,  December  1995 

Colbaugh,  R.  and  K.  Glass,  “Adaptive  Control  of  Electrically-Driven  Manipulators: 
Theory  and  Experiments”,  Proc.  1995  IEEE  Conference  on  Control  Applications, 
Albany,  NY,  September  1995 

Wedeward,  K.  and  R.  Colbaugh,  “Adaptive  Position/Force  Regulation  Without  Rate 
Information”,  Proc.  1995  IEEE  Conference  on  Control  Applications,  Albany,  NY, 
September  1995 

Colbaugh,  R.  and  K.  Glass,  “Adaptive  Tracking  Control  of  Rigid-Link  Electrically- 
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Abstract 

This  paper  considers  the  problem  of  controlling  nonholonomic  mechanical  systems 
in  the  presence  of  incomplete  information  concerning  the  system  model  and  state,  and 
presents  a  class  of  decentralized  adaptive  controllers  as  a  solution  to  this  problem.  The 
proposed  control  strategies  provide  simple  and  robust  solutions  to  a  number  of  important 
nonholonomic  system  control  problems,  including  stabilization  to  an  equilibrium  manifold, 
stabilization  to  an  equilibrium  point,  and  trajectory  tracking  control.  All  of  the  schemes 
are  computationally  efficient,  are  implementable  without  system  dynamic  model  or  rate 
information,  and  ensure  uniform  boundedness  of  all  signals  and  accurate  motion  control. 
Computer  simulation  results  axe  provided  to  complement  the  theoretical  developments. 

1.  Introduction 

There  is  great  theoretical  and  practical  interest  in  controlling  mechanical  systems  in 
the  presence  of  measurement  and  model  uncertainty.  It  is  well-known  that  control  schemes 
which  require  precise  knowledge  of  the  complete  system  model  to  provide  good  performance 
suffer  from  many  serious  limitations.  Additionally,  it  is  usually  unrealistic  to  assume 
that  the  entire  system  state  can  be  accurately  measured  and  used  for  feedback;  indeed, 
although  high-precision  sensors  are  available  for  measuring  the  configuration  of  mechanical 
systems,  rate  measurements  are  typically  either  contaminated  with  noise  or  not  available 
at  all.  Recognition  of  these  facts  has  motivated  considerable  interest  in  developing  control 
methods  which  can  be  successfully  implemented  despite  model  and  state  uncertainties. 
This  challenging  problem  becomes  even  more  difficult  in  the  important  case  in  which 
the  system  is  subjected  to  holonomic  or  nonholonomic  (nonintegrable)  constraints  on  its 
kinematics. 

Nonholonomically  constrained  mechanical  systems,  in  particular,  have  attracted  sig¬ 
nificant  attention  recently.  Much  of  this  interest  is  a  consequence  of  the  importance  of 
such  systems  in  applications.  For  example,  nonholonomic  constraints  arise  in  systems 
with  rolling  contact,  such  as  wheeled  vehi<2lesl%nd  multifingered  robotic  hands,  and  in  sys- 
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terns  for  which  the  dynamics  admits  a  symmetry,  such  as  space  structures  with  angular 
momentum  conservation;  observe  that  a  wide  variety  of  systems  of  practical  importance 
are  constrained  in  this  way.  Interest  in  studying  nonholonomic  systems  is  also  motivated 
by  their  role  as  a  class  of  “strongly”  nonlinear  systems  for  which  traditional  nonlinear 
control  methods  are  insufficient  and  new  approaches  must  be  developed.  For  instance,  the 
investigation  of  these  systems  has  led  to  recent  progress  in  geometric  control  theory  and 
geometric  mechanics. 

Most  of  the  work  reported  to  date  on  controlling  nonholonomic  mechanical  systems  has 
focused  on  the  steering  or  trajectory  generation  problem;  the  interested  reader  is  referred  to 
[1,2]  for  a  survey  of  some  of  this  work  and  to  [3-9]  for  an  indication  of  recent  progress.  This 
research  has  produced  numerous  interesting  solutions  to  the  trajectory  generation  problem, 
and  has  led  to  advances  in  nonlinear  control  theory  and  practice.  However,  the  work  has 
proceeded  by  considering  the  system  velocities  to  be  the  control  inputs  and  working  with 
the  system  kinematic  model,  thereby  addressing  the  problem  at  the  kinematic  control  level 
and  ignoring  the  mechanical  system  dynamics. 

There  are  important  reasons  for  formulating  the  nonholonomic  system  control  prob¬ 
lem  at  the  dynamic  control  level,  where  the  control  inputs  are  those  produced  by  the 
system  actuators  and  the  system  model  contains  the  mechanical  system  dynamics.  For 
example,  since  this  is  the  level  at  which  control  actually  takes  place  in  practice,  designing 
controllers  at  this  level  can  lead  to  improved  implementability  and  can  help  in  the  early 
identification  and  resolution  of  difficulties.  It  is  interesting  to  consider  the  problem  of 
controlling  holonomically  constrained  robots  in  this  regard:  the  kinematic  control  problem 
in  this  case  is  trivial,  but  the  dynamic  control  problem  is  often  quite  challenging  [e.g.,  10]. 
Another  motivation  for  considering  the  dynamic  control  problem  is  that  many  nonholo¬ 
nomic  systems  are  most  naturally  studied  at  this  level,  so  that  such  an  approach  broadens 
the  scope  of  potential  applications  [e.g.,  11].  Recognizing  the  importance  of  addressing 
the  nonholonomic  system  control  problem  at  the  dynamic  control  level,  some  researchers 
have  considered  this  problem  in  recent  years  [e.g.,  2,  12-17].  Significant  progress  has  been 
made  in  understanding  the  fundamental  characteristics  of  these  systems,  and  several  use¬ 
ful  dynamic  controllers  have  been  proposed.  However,  virtually  all  of  the  dynamic  control 
strategies  proposed  to  date  have  been  developed  by  assuming  that  the  full  dynamic  model 
is  precisely  known  and  that  the  entire  system-state  is  measurable. 

This  paper  considers  the  dynamic  control  of  uncertain  nonholonomic  mechanical  sys¬ 
tems,  and  proposes  that  the  problem  be  studied  within  an  adaptive  control  framework. 
The  majority  of  the  research  in  adaptive  control  of  mechanical  systems  has  proceeded  by 
assuming  that  the  uncertainty  in  the  system  model  can  be  linearly  parameterized  using 
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constant  coefficients  [e.g.,  18-21].  This  assumption  permits  the  design  of  controllers  for 
which  system  stability  can  be  ensured,  but  it  can  be  limiting.  For  example,  controllers 
developed  using  this  approach  are  typically  very  complex,  require  full  state  measurement, 
are  not  easily  implementable  in  terms  of  arbitrary  generalized  coordinates,  and  have  ex¬ 
hibited  poor  robustness  and  transient  performance.  These  characteristics  suggest  that  this 
methodology  may  not  be  well  suited  for  applications  involving  nonholonomic  systems.  Fur¬ 
thermore,  this  formulation  leads  inevitably  to  centralized  algorithms,  in  which  the  control 
input  for  each  configuration  degree-of-freedom  (DOF)  depends  on  all  of  the  other  DOF; 
this  is  a  consequence  of  the  need  to  incorporate  the  system  dynamic  model  directly  into  the 
control  law.  In  contrast  with  this  approach  to  adaptive  robot  control,  the  present  schemes 
are  developed  using  the  recently  proposed  performance-based  adaptive  control  methodol¬ 
ogy,  in  which  the  adaptive  laws  adjust  the  controller  gains  based  on  system  performance 
rather  than  on  knowledge  of  the  manipulator  model  [22-25].  As  a  consequence,  it  is  possible 
to  design  the  schemes  so  that  they  are  computationally  efficient,  ensure  good  robustness 
and  transient  performance,  and  are  implementable  in  terms  of  any  set  of  generalized  coor¬ 
dinates  without  rate  measurements  or  information  regarding  the  system  dynamic  model. 
Additionally,  the  controllers  can  be  given  a  simple  decentralized  structure,  in  which  the 
control  input  for  each  configuration  DOF  is  computed  based  on  information  concerning 
only  that  DOF.  Note  that  decentralized  control  offers  significant  potential  benefits  relative 
to  centralized  control,  including  enhanced  computational  simplicity,  ease  of  implemen¬ 
tation,  increased  robustness  and  fault  tolerance,  and  reduced  reliance  on  system  model 
information.  The  control  strategies  proposed  in  this  paper  provide  solutions  to  a  number 
of  important  nonholonomic  system  stabilization  and  trajectory  tracking  problems,  and  all 
are  shown  to  ensure  uniform  boundedness  of  all  signals  and  accurate  motion  control.  The 
efficacy  of  the  proposed  approach  is  illustrated  through  computer  simulations  with  two 
nonholonomic  mechanical  systems:  a  mobile  robot  with  rolling  constraints  and  a  planar 
mechanism  for  which  angular  momentum  is  conserved. 

The  paper  is  organized  in  the  following  manner.  Some  preliminary  facts  of  relevance 
to  the  nonholonomic  system  control  problem  are  established  in  Section  2.  In  Sections  3 
through  5  the  decentralized  adaptive  stabilization  and  tracking  schemes  are  presented  and 
analyzed  and  the  performance  of  the  proposed  controllers  is  illustrated  through  computer 
simulation.  Finally,  Section  6  summarizes-  the  paper  and  provides  some  suggestions  for 
future  work. 
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2.  Preliminaries 

Consider  the  class  of  nonholonomic  mechanical  systems  described  by  the  equations 

[e-g-j  12] 


M(x)T  =  J?(x)x  +  Vcc(x,  x)x  +  G(x)  +  AT(x)A  (la) 

A(x)x  =  0  (15) 

where  x  €  3Jn  is  the  vector  of  system  generalized  coordinates,  T  £  is  the  vector 

of  actuator  inputs,  M  :  $tn  —>  3£nxp)  is  bounded  and  of  full  rank,  H  :  3£n  — >  3?nXn 

is  the  system  inertia  matrix,  Vcc  :  x  3}nXn  quantifies  Coriolis  and  centripetal 

acceleration  effects,  G  :  arises  from  the  system  potential  energy,  A  :  3tn  — >  3?mXn 

is  a  bounded  full  rank  matrix  quantifying  the  nonholonomic  constraints,  A  6  is  the 
vector  of  constraint  multipliers,  and  all  functions  axe  assumed  to  be  smooth.  We  note 
that  a  coordinate-free  formulation  could  be  given  in  terms  of  a  mechanical  control  system 
on  the  tangent  bundle  of  a  smooth  manifold  [e.g.,  26],  in  which  case  (1)  would  represent 
the  system  in  local  coordinates;  however,  working  directly  with  the  model  (1)  is  sufficient 
for  the  present  study.  The  basic  problem  to  be  considered  in  this  investigation  is  that 
of  specifying  the  control  input  T,  using  only  measurements  of  the  system  configuration 
x  and  with  no  knowledge  of  the  system  dynamic  model  (la),  so  that  a  particular  control 
objective  is  achieved.  For  example,  a  fundamental  problem  is  the  stabilization  of  the 
system  configuration  to  some  goal  x^.  It  is  well-known  that  the  presence  of  nonholonomic 
constraints  on  the  system  kinematics  complicates  the  control  problem  considerably.  Indeed, 
it  can  be  shown  that  for  nonholonomic  systems  even  the  basic  stabilization  problem  cannot 
be  solved  using  continuous  static  state  feedback  [27,12],  These  difficulties  are  only  increased 
in  the  case  of  control  in  the  presence  of  model  and  state  uncertainty. 

It  is  well-known  that  the  mechanical  system  dynamics  (1)  possesses  considerable  struc¬ 
ture.  For  example,  for  any  set  of  generalized  coordinates  x,  the  matrix  H  is  symmetric 
and  positive-definite,  the  matrix  Vcc  depends  linearly  on  x,  and  the  matrices  H  and  Vcc 
are  related  according  to  H  -  Vcc  +  V?  [e.g.,  28].  Additionally,  we  will  assume  in  what 
follows  that  the  inertia  matrix  H  and  potential  energy  gradient  G  are  bounded  functions 
with  bounded  first  partial  derivatives;  this  latter  property  holds  for  virtually  all  mechanical 
systems  of  interest  in  applications,  such  as  robotic  systems  and  vehicles.  In  this  case  it  is 
easy  to  show  that  Vcc  is  bounded  in  x,  that  Vcc(x,  x)y  =  Vcc(x,y)x  Vy,  and  that  if  y  and 
y  are  bounded  then  Vcc(x,y)  is  bounded  and  V'cc(x,y)  grows  linearly  with  x.  All  of  these 
properties  will  prove  useful  for  controller  development. 

Observe  that  the  rows  of  A ,  say  a,  £  3?1  x n  for  i  —  1, 2, ...,  m,  are  smooth  covectors  on 
the  configuration  space  &n,  and  that  the  fe&mption  that  A  is  full  rank  implies  that  the 
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codistribution  spanned  by  the  a;  has  dimension  m.  The  annihilator  of  this  codistribution  is 
an  r  =  n  —  m  dimensional  smooth  distribution  A  =  span[ri(x),  r2(x),  rr(x)],  where  the 

r,  are  smooth  vector  fields  on  the  configuration  space  which  satisfy  Ar^  =  0  Vx.  Defining 
R  =  [ri,r2,..., rr]  €  3£nXr  permits  this  relationship  to  be  expressed  more  concisely  as 
AR  =  0.  As  an  example,  let  the  matrix  A  be  partitioned  as  A  =  [A\  A2],  with  A\  €  3?mXm 
and  A2  €  3JmXr  and  where  A\  is  nonsingular  (this  is  always  possible,  possibly  with  a 
reordering  of  the  configuration  coordinates).  Then  R  can  be  constructed  as  follows: 


R  = 


-Aj  1A2 

It 


(2) 


where  IT  is  the  rxr  identity  matrix.  Consider  now  the  involutive  closure  of  A,  denoted  A* 
and  defined  as  the  smallest  involutive  distribution  containing  A.  We  will  assume  in  what 
follows  that  A*  has  constant  rank  n  on  the  configuration  space.  In  this  case,  Frobenius’ 
theorem  [e.g.,  26]  shows  that  the  constraints  axe  nonintegrable,  so  that  the  system  (1)  is 
(completely)  nonholonomic  and  there  is  no  explicit  constraint  on  the  configuration  space. 
We  note  that  no  generality  is  lost  with  this  assumption  because,  if  the  dimension  of  A*  is 
m*  <  n,  then  n  —  m*  configuration  coordinates  are  redundant  and  could  be  eliminated, 
yielding  a  (completely)  nonholonomic  system. 

It  is  often  desirable  from  a  system  control  perspective  to  eliminate  the  constraint 
multiplier  A  from  the  system  description.  One  means  of  doing  this  is  to  first  define  a 
partition  of  x  corresponding  to  the  partition  specified  for  A,  so  that  x  =  [x^  x^]T  with 
Xi  €  and  X2  €  3?r.  Observe  that  the  definition  (2)  and  the  constraint  equation 
(lb)  imply  that  the  system  velocities  are  parameterized  by  x2  via  x  =  R(x)x2.  This 
parameterization  then  permits  (1)  to  be  reformulated  as 

RtMT  =  RtHRx2  +  Rt(HR  +  VccR)x2  +  RtG 
A(x)x  =  0 


or 


F  =  H*(x)x2  +  v;c(x,  x2)x2  +  G*(x)  (3a) 

xi  =  A*(x)x2  __  _  (36) 

where  F  =  RtMT,  H*  =  RTHR,  V*c  =  RT(HR+VCCR ),  G*  =  RT G,  and  A*  =  -Af 1 A2. 
Note  that  (3)  is  a  (2n  —  m)  order  differential  equation  which  defines  the  evolution  of  the 
2 n  —  m  system  states  (x,x2).  In  what  follows,  it  is  assumed  that  p>  r  and  RT M  is  full 
rank,  so  that  any  desired  F  can  be  realized  tliftaugh  proper  specification  of  T  and  therefore 
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the  system  (3a)  is  fully  actuated.  The  following  lemma  characterizes  some  of  the  structure 
of  (3),  essentially  stating  that  (3)  inherits  the  “nice”  structure  of  (1). 

Lemma  1:  The  dynamic  model  terms  axe  bounded  functions  of  x  whose  time 

derivatives  H*,  G*  are  also  bounded  in  x  and  depend  linearly  on  X2,  the  matrix  H*  is 
symmetric  and  positive-definite,  and  the  matrices  H*  and  V*c  are  related  according  to 
H*  =  V*c  +  V*CT.  Additionally,  Vc*(x,x2)y  =  Vc*(x,y)x2  Vy,  and  if  y  and  y  are  bounded 
then  Fc*  (x,  y)  is  bounded  and  V^.*(x,y)  grows  linearly  with  x2. 

Proof:  All  of  the  properties  can  be  established  through  direct  calculation  using  the  defi¬ 
nitions  of  H*,  V*c,  and  G*  and  the  properties  of  H,  Fcc,  and  G.  ■ 

Each  of  the  control  systems  proposed  in  this  paper  consists  of  two  subsystems:  a 
decentralized  adaptive  scheme  that  provides  the  “fictitious”  control  input  F  required  to 
ensure  that  the  given  control  objective  is  realized,  and  an  algorithm  for  mapping  this 
“fictitious”  input  F  to  a  physically  realizable  control  input  T.  Given  the  assumptions 
regarding  the  matrix  RTM  there  is  always  a  T  corresponding  to  any  desired  input  F, 
and  indeed  it  is  straightforward  to  determine  this  T.  For  example,  if  p  =  r  then  T  is 
uniquely  determined  from  F  via  T  =  (RTM)~1  F  ,  and  if  p  >  r  then  T  can  be  chosen 
to  give  the  desired  F  and  simultaneously  achieve  some  additional  objective.  Therefore 
for  the  remainder  of  the  paper  it  is  assumed  that  F  can  be  commanded  directly,  and  the 
focus  of  the  subsequent  discussion  is  on  specifying  this  input  so  that  the  given  control 
objective  is  realized  in  the  presence  of  model  and  state  uncertainty.  In  this  paper  we  shall 
address  both  the  motion  stabilization  problem,  in  which  the  objective  is  to  ensure  that  the 
system  evolves  from  its  initial  state  to  some  desired  final  state,  and  the  trajectory  tracking 
problem,  in  which  the  evolution  from  initial  state  to  desired  final  state  is  to  be  along  some 
specified  trajectory.  In  what  follows,  it  is  assumed  that  x  (but  not  x)  is  measurable  and 
that  little  information  is  available  regarding  the  system  model. 

3.  Adaptive  Stabilization  to  an  Equilibrium  Manifold 

In  this  section,  we  begin  our  study  of  the  problem  of  stabilizing  the  nonholonomic 
system  (1)  in  the  presence  of  uncertainty.  Recall  that  the  nonholonomic  nature  of  the 
constraints  significantly  increases  the  difficulty  of  controlling  this  system,  so  that  even  the 
basic  problem  of  stabilizing  (1)  to  a  desired '(constant)  configuration  x^  is  not  solvable 
using  continuous  static  state  feedback  [27,12],  In  view  of  this  fact,  we  examine  first  the 
simpler  problem  of  stabilizing  (1)  to  an  “equilibrium  manifold”  of  dimension  m  rather  than 
to  an  equilibrium  point  (see,  e.g.,  [12]  for  a  definition  of  stabilizing  a  system  to  a  manifold). 
Note  that  this  problem  is  solvable  using  srpopth  static  feedback  and  can  be  viewed  as  the 
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natural  extension  of  work  on  stabilizing  holonomically  constrained  systems. 

3.1  Control  Scheme 

Let  us  define  an  equilibrium  manifold  Ne  as  follows  [12]: 

Ne  =  {x,  x  |  s(x)  =  0,  x  =  0} 

where  s  :  3?"  — *  3?r  is  a  smooth  “output”  function  which  can  be  specified  as  desired  subject 
to  the  (transversality)  condition  that  the  matrices  ds/dx2  and  (ds/dx)R  are  nonsingular. 
We  seek  a  control  law  which  stabilizes  (1)  to  the  smooth  m-dimensional  manifold  Ne  and 
which  is  implementable  without  any  knowledge  of  the  system  dynamic  model  and  using 
only  measurements  of  system  configuration.  For  simplicity  of  development  we  shall  choose 
s  =  X2  in  what  follows,  so  that  the  system  representation  (3)  can  be  used  directly.  However, 
a  system  model  analogous  to  (3)  can  be  derived  in  terms  of  any  other  output  s  which 
satisfies  the  transversality  condition,  so  that  the  controller  design  procedure  given  here 
can  be  applied  in  the  general  case  [29].  In  fact,  the  model  independence  of  the  proposed 
approach  to  system  stabilization  makes  the  controller  development  quite  modular  and  easy 
to  apply  for  any  choice  of  output. 

Consider  the  following  adaptive  scheme: 

Fi  =  fi(t )  +  h^Wi  +  k2'r2esi 

Wi  =  -27u>i  +  7  2isi  (4) 

fi  =  —<rfi  +  @qai  for  i  =  1, 2,  ...,n  —  m 

where  the  subscript  i  refers  to  the  zth  element  of  a  vector,  es  =  — s  is  a  measure  of  distance 
from  Ne,  qs  =  es  +  k2e3/kij— w/7,  f(t)  €  3Jr  is  the  adaptive  element,  ki,k2,~/  are  positive 
constants,  j3  is  the  adaptation  gain  for  f,  and  <7  is  defined  as  follows:  a  =  0  if  ||  f  ||<  fmax 
and  a  =  <jq  otherwise,  with  <To  and  fmax  positive  scalars.  The  scheme  (4)  can  be  viewed  as 
a  natural  extension  of  the  well-known  PD-plus-potential-energy  control  law  for  mechanical 
systems  [e.g.,  26],  with  the  velocity  feedback  component  of  the  controller  replaced  by  a 
first  order  linear  compensator  and  with  potential  energy  compensation  provided  through 
adaptation.  It  will  be  seen  below  that  w  provides  a  simple  means  of  injecting  damping  into 
the  closed-loop  system  without  using  velocity  feedback,  and  that  f  effectively  compensates 
for  the  potential  energy  effects  represented  by  G*.  The  definition  of  a  provides  a  “switching 
cr-modification”  adaptation  law  [30].  This  approach  to  adaptive  control  has  been  proposed 
for  linear  systems  to  provide  robust  performance  in  the  presence  of  external  disturbances 
while  retaining  asymptotic  convergence  of  tftfe  system  error  in  the  ideal  case  in  which  no 
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disturbances  axe  present  [30].  In  what  follows,  it  will  be  shown  that  the  formulation  (4) 
yields  a  similar  robust  adaptive  performance  for  the  problem  of  stabilizing  the  nonlinear 
mechanical  system  (1).  Observe  that  (4)  is  implementable  without  velocity  information 
because,  although  w  and  f  depend  on  es,  the  control  law  terms  w  and  f  depend  only  on 
es  (as  can  be  verified  through  direct  integration  of  the  w  and  f  equations). 

The  stability  properties  of  the  proposed  adaptive  regulation  strategy  (4)  are  summa¬ 
rized  in  the  following  theorem. 


Theorem  1:  If  7  is  chosen  sufficiently  large  then  the  adaptive  scheme  (4)  ensures  that 

(3)  evolves  with  e3,  es,  w,  and  f  (semiglobally)  uniformly  bounded  and  so  that: 

1.  If  the  system  potential  energy  Z7(x )  and  its  first  partials  are  constant  on  JVe  then  es 
and  es  converge  asymptotically  to  zero  (note  that  potential  energy  functions  of  this 
sort  are  very  common  in  applications). 

2.  If  the  system  potential  energy  t/(x)  varies  on  Ne  then  ea  and  es  converge  asymptoti¬ 
cally  to  a  compact  set  which  can  be  made  arbitrarily  small  (by  increasing  0). 

Proof:  We  consider  first  the  case  in  which  U(x)  and  its  first  partials  are  constant  on  Ne. 

Applying  the  control  law  (4)  to  the  mechanical  system  dynamics  (3)  yields  the  closed-loop 

error  dynamics 


(5) 


(6) 


H*es  +  V*ces  +  &i72w  +  k2'y2es  +  $  +  G*(0)  -  G*(x)  =  0 
where  $  =  f  —  G*(0).  Consider  the  Lyapunov  function  candidate 

Vl  =  +  ^272efea  +  ^iwrw  +  -^-ejH*e3 

-  iw TH*es  +  +  U(x)  -  U( 0)  +  G*r(0)es 

and  note  that  V\  is  a  positive-definite  and  radially-unbounded  function  of  es,  es,  w  and 
$  if  7  is  chosen  sufficiently  large  (for  example,  if  7  is  chosen  large  enough  then  the  term 
k2 72eJ’es/2  +  U{x)  -  U( 0)  +  G*T(0)es  can  be  shown  to  be  positive-definite  in  ea). 

Differentiating  (6)  along  (5)  and  simplifying  using  the  structural  properties  of  the 
mechanical  system  dynamics  summarized  in  Lemma  1  yields 

Vi  =  -(7  -  p-)eTH*e  -  Si 

h7  h 


e  ||2  —  ki^  ||  w  ||2  +2w TH*e 


1 


+  {T~V*ce  -  K>]  +  M  +  (G*(x)  -  G*(0))(-j-=-e 
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0 


2  \\  /  TT*\  II  1  II 2  Ml  k2M 


+  2Xm(H-)  ||  w 


M  „ 

+-  ||  w 


)  II  es 

kMc 

k\7 


k\7 

2  -kl7  ||  w  ||2 


-w) 
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(7) 


k 

|2  |  ^cc 

7 


w 


8 


where  Am(-),  Xm(-)  denote  the  minimum  and  maximum  eigenvalue  of  the  matrix  argu¬ 
ment,  respectively,  kcc  is  a  scalar  upper  bound  on  the  linear  dependency  of  V*c  on  x  (i.e., 
II  V*c  IIf  —  ^cc  ||  x  ||  Vx  with  ||  •  ||p.  the  Frobenius  matrix  norm),  and  M  is  a  positive 
scalar  constant  satisfying  M  ||  x  ||>||  G*(x)  -  G*(0)  ||  Vx  (the  boundedness  of  the  partial 
derivatives  of  G*  ensures  that  such  an  M  exists).  Define  z  =  [||  e3  ||  ||  e3  ||  ||  w  ||]T  and 
Qi  as 


Q  i 


Mi  _  hM. 

k\  ku 
0 

27 


0 

(7-&)A  m(H') 


M 

27 


-A  m{H*) 
hi 


and  note  that  Q\  is  positive-definite  if  7  is  chosen  large  enough.  Then  the  following  bound 
on  Vi  in  (7)  can  be  obtained  through  routine  manipulation: 


Vi<-(AmWi)--V11/2)|M|2  (8) 

7 

where  aj  is  a  scalar  constant  which  does  not  increase  as  7  is  increased.  Let  ci(t)  = 
Am(Qi)— &1V11  (t) / 1  ^d  choose  7  large  enough  so  that  cj(0)  >  0  (this  is  always  possible). 

Then  (8)  implies  that  ci(0)  <  cj(i)  Vi,  so  that  Vi  <  — cj(0)  ||  z  ||2.  Standard  arguments 
can  then  be  used  to  show  that  z  and  $  are  bounded  (and  therefore  that  e3,  e3,  w,  and  f 
are  bounded)  and  that  z  converges  to  zero  (and  therefore  that  es,  e3,  and  w  converge  to 
zero). 

We  consider  next  the  case  of  general  U(x).  Applying  the  adaptive  scheme  (4)  to  the 
mechanical  system  dynamics  (3)  yields  the  closed-loop  error  dynamics 


H*es  +  V*ce3  +  &!72w  +  k2i2e3  +  <&«*  =  0 


(9) 


where  §>,*  =  f  —  G*.  Define  the  Lyapunov  function  candidate 


v2  =  +  1^27 2efe,  + 


^wTH"e,  +  (10) 


and  note  that  V2  is  a  positive-definite  and  radially-unbounded  function  of  es,  e3,  w  and 
if  7  is  chosen  sufficiently  large.  Differentiating  (10)  along  (9)  and  simplifying  as  in  the 
first  part  of  the  proof  yields  ~~ 


y2<_(7__L)Am(ir)  ||  es 


.Mi 
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e*  ||2  ~hl  ||  w  ||2  ||  w  mi  ea 
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hk 
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Defining  Q2  as 


Q2  = 


kj  7 


*1 

0 

0 


0 


-A  U(H-) 

-A  m(H")  Ai7 

(and  noticing  that  Q2  is  positive-definite  if  7  is  chosen  large  enough)  permits  the  following 
bound  on  V2  in  (11)  to  be  obtained: 


V2  <  -ztQ2z  + 


7 


hhc 

hi 


u 

|2  |  “'Co 
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w 


-  jA m(H')  ||  e 


-£#2>f+G*) 


(12) 


If  fmax  is  selected  so  that  ||  G*(x)  ||<  fmax  Vx  then  the  following  bound  on  V2  in  (12) 
can  be  derived  through  routine  manipulation: 

%  <  -(A „((?,)  -  V: ?»)  ||  2  f  ||  f 

7  P 

where  a.2  and  tj2  are  scalar  constants  which  do  not  increase  as  7,  (3  increase. 

The  remainder  of  the  proof  will  utilize  the  ultimate  boundedness  result  presented  by 
the  authors  in  [23,24],  which  is  based  on  the  work  of  Corless  and  Leitmann  [31].  Toward 
this  end,  it  is  convenient  to  bound  V2  in  (10)  in  the  following  manner: 

A.  II  z  II2  II  IP<  V2  <  A,  ||  2  |p  ||  |p 

where  the  A,-  are  positive  scalar  constants  independent  of  /?.  Now  choose  two  scalar  con¬ 
stants  Vm,  Vm  so  that  Vm  >  Vm  >  T4(0),  and  define  cm  =  \m(Q2)  —  ^Vm1^2 /t,  then 
choose  7  large  enough  so  that  cm  >  0  (this  is  always  possible).  Let  8*  =  A3 772 /cm  + 
2max(2 f^ax ,rj2/a0)  and  choose  /30  so  that  8*  /  j30  <  Vm  (this  is  always  possible).  Then 
selecting  /3  >  /30  ensures  that  if  Vm  <V2<  Vm  then  V2  <  0.  This  condition  together  with 
Vm  >  Vm  >  ^2(0)  implies  that  V2(t)  <  Vm  Vi,  so  that  c2(t)  =  Am(Q2)  —  «2^1/,2(t)/7  > 
cm  >  0  Vi  and 


V2  <  -cM  z 


2(Ar+  A/3) 


(/?o  +  A/3) 


where  A/3  =  /3  —  /3q  and  it  is  assumed  that  (3  is  chosen  so  that  A(3  >  0.  It  follows 
immediately  that  ||  z  ||  and  ||  |(  are  uniformly  bounded.  Moreover,  the  ultimate 

boundedness  result  given  in  [23,24]  is  now  directly  applicable  and  permits  the  conclusion 
that  ||  z  ||  converges  asymptotically  to  a  dlo^ti  ball  of  radius  r2  =  8*/(X1(/3q  +  A/3)).  ■ 
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Several  observations  can  be  made  concerning  the  decentralized  adaptive  stabilization 
strategy  (4).  First,  the  preceding  stability  analysis  demonstrates  that  this  scheme  ensures 
that  the  mechanical  system  dynamics  (3)  evolves  from  any  initial  configuration  to  any  de¬ 
sired  (and  suitably  defined)  m-dimensional  equilibrium  manifold  with  uniformly  bounded 
signals  and  with  desirable  convergence  properties  for  the  output  error:  asymptotic  conver¬ 
gence  to  zero  if  the  system  potential  energy  is  a  constant  on  the  equilibrium  manifold  and 
convergence  to  an  arbitrarily  small  neighborhood  of  zero  in  the  general  case.  Note  that, 
in  the  general  case,  the  ultimate  size  of  the  regulation  error  can  be  reduced  as  desired 
simply  by  increasing  a  single  controller  parameter  (the  adaptation  gain  /?).  It  is  easy  to 
show  that  the  above  robustness  to  potential  energy  variation  on  Ne  implies  a  robustness  to 
any  bounded  external  disturbance;  this  generalization  to  the  case  of  external  disturbances 
is  analogous  to  that  given  in  [23,24]  for  holonomically  constrained  robotic  systems  and  is 
worked  out  in  detail  in  [29].  The  adaptive  stabilization  strategy  (4)  possesses  a  simple 
decentralized  structure,  utilizes  only  measurements  of  system  configuration,  and  does  not 
require  knowledge  of  the  structure  or  parameter  values  for  the  system  model  or  any  pay- 
load.  Thus  the  proposed  scheme  provides  a  computationally  efficient,  modular,  and  readily 
implementable  solution  to  the  equilibrium  manifold  stabilization  problem.  Observe  that 
the  stability  properties  established  in  Theorem  1  are  semiglobal  in  the  sense  that  the  re¬ 
gion  of  attraction  can  be  increased  arbitrarily  by  increasing  the  controller  gain  7.  It  is 
stressed  that  this  does  not  imply  that  7  must  be  chosen  to  be  overly  large;  indeed,  we  have 
found  that  excellent  performance  is  obtainable  with  this  gain  set  to  quite  modest  values. 
It  is  mentioned  that,  for  clarity  of  exposition,  the  calculations  required  in  the  proof  of 
the  Theorem  1  have  been  condensed  significantly  in  the  above  presentation;  the  interested 
reader  is  referred  to  the  report  [29]  for  the  details  of  all  of  these  calculations.  An  additional 
observation  is  that  the  scalar  controller  gains  k\,  &2,  7,  and  3  can  be  replaced  with  the 
appropriate  diagonal  matrices  for  increased  flexibility  and  performance;  scalar  gains  are 
used  above  for  simplicity  of  development. 

3.2  Implementation  Results 

The  adaptive  stabilization  scheme  described  above  is  now  applied  to  a  mobile  robot 
through  computer  simulation.  The  mobile  robot  chosen  for  this  simulation  study  is  the 
two  wheel  vehicle  shown  in  Figure  1  and  described  in  [14].  The  dynamic  model  (1)  has  the 
following  form  for  this  system: 

mx  =  Acos0  —  pi[Ti  +  T2)sin0  +  di(t) 

my  =  Asinfl  +  p\(T\  +  T2)cos0  +  ^(t)  (13) 

J«  =  P2(H-  3?)  20 
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0  =  icos  6  +  ysin# 

where  x,  y,  6  are  the  position  and  orientation  coordinates  for  the  (axle  of  the)  mobile  robot, 
m,  J  are  the  system  inertial  parameters,  pi,p2  are  kinematic  parameters,  T\:T2  are  the 
torques  provided  at  the  wheels,  and  d\ ,  d2  are  external  disturbances.  In  this  simulation, 
d\,  d2  are  modeled  as  the  sum  of  a  constant  (bias)  force  with  magnitude  equal  to  one  half 
the  maximum  undisturbed  control  force  and  a  zero-mean  Gaussian  signal  with  standard 
deviation  of  one  tenth  the  maximum  undisturbed  control  force.  Note  that  in  all  of  the 
simulations  presented  in  this  paper  the  unit  of  length  is  meter,  the  unit  of  time  is  second, 
and  the  unit  of  angle  is  degree,  unless  stated  otherwise. 

In  this  simulation  study,  the  vector  of  system  outputs  is  chosen  to  be  s  =  [y  0\ thus 
the  control  objective  is  to  drive  the  mobile  robot  from  any  initial  configuration  x(0)  = 
[x(0)  y(0)  0(O)]T  to  the  x-axis  in  the  presence  of  external  disturbances  and  with  no 

model  information  or  rate  measurements.  The  stabilization  scheme  (4)  is  applied  to  the 
mathematical  model  of  the  mobile  robot  described  above  and  shown  in  Figure  1  through 
computer  simulation  with  a  sampling  period  of  two  milliseconds.  All  integrations  required 
by  the  controller  are  implemented  using  a  simple  trapezoidal  integration  rule  with  a  time- 
step  of  two  milliseconds.  Additional  details  concerning  the  simulation  strategy  for  this 
study  can  be  found  in  [29].  The  system  model  parameters  are  defined  as  m  =  J  =  10  and 
Pi  =  P2  =  1,  and  the  controller  parameters  k2,j  are  set  as  follows:  ki  =  10,  k2  =  20, 
7  =  5.  The  adaptive  gain  f  is  set  to  zero  initially  and  the  adaptation  parameters  are  set 
as  follows:  ao  =  0.1,  /3  =  10,  and  /mai  =  100.  It  is  noted  that  no  attempt  was  made  to 
tune  the  gains  to  obtain  the  best  possible  performance.  The  stabilization  scheme  (4)  was 
tested  using  a  wide  range  of  initial  conditions.  Results  for  a  typical  simulation  are  given 
in  Figures  2a  and  2b,  with  Figure  2a  showing  the  response  of  the  y  coordinate  and  Figure 
2b  depicting  the  evolution  of  the  orientation  coordinate.  The  response  for  other  initial 
conditions  were  quite  similar  and  hence  are  not  shown.  Observe  that  accurate  and  robust 
equilibrium  manifold  stabilization  of  the  system  is  achieved.  We  mention  that  other  choices 
of  system  output  and  external  disturbances  were  also  tested  in  this  simulation  study  and 
that  similar  performance  was  obtained  in  all  cases. 

4.  Adaptive  Control  of  Differentially  Flat  Systems 

Having  developed  a  simple  and  robust  strategy  for  stabilizing  the  nonholonomic  me¬ 
chanical  system  (1)  (or  (3))  to  an  m-dimensional  equilibrium  manifold,  we  now  consider 
the  problem  of  stabilizing  the  system  to  an  equilibrium  point  xj.  As  mentioned  above,  it 
is  known  that  this  problem  is  not  solvable  using  continuous  static  state  feedback  [27,12]. 
However,  it  has  been  shown  in  [12]  that  ihSlsystem  (1)  is  strongly  accessible  and  small 
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time  locally  controllable  at  an  equilibrium  state  (x</,  0).  Thus  it  makes  sense  to  study 
the  equilibrium  point  stabilization  problem,  but  to  investigate  techniques  other  than  the 
simple  strategy  derived  in  the  previous  section.  In  this  section  and  the  next  we  focus  our 
attention  on  important  subclasses  of  the  nonholonomic  mechanical  system  (1),  and  develop 
strategies  for  controlling  these  systems  which  exploit  the  additional  structure  present  in 
these  subclasses.  In  each  case  our  approach  is  to  first  consider  the  problem  of  generating 
a  trajectory  for  the  system  which  will  take  the  system  to  the  goal  configuration,  and  then 
to  design  a  controller  capable  of  ensuring  that  the  system  accurately  tracks  this  trajectory 
in  the  presence  of  model  and  measurement  uncertainty. 

4.1  Background 

One  type  of  nonholonomic  system  for  which  trajectory  generation  is  particularly 
straightforward  is  the  class  of  so-called  differentially  flat  systems  [e.g.,  7,17].  Roughly 
speaking,  a  system  is  differentially  flat  if  there  is  a  set  of  outputs,  equal  in  number  to  the 
number  of  inputs,  such  that  all  states  and  inputs  can  be  determined  from  these  outputs 
without  integration.  More  precisely,  a  system  with  states  s  (E  3ft2n  and  inputs  u  6  is 
flat  if  we  can  find  outputs  y  €  of  the  form 

y  =  y(s,  u,  u,...,u(?)) 

such  that 

s  =  s(y,y,-,y(s))  (14a) 

u  =  u(y,y,...,y(s))  (146) 

Differentially  flat  systems  have  attracted  considerable  attention  recently  and,  while  there 
are  no  general  methods  for  determining  whether  or  not  a  particular  system  is  flat,  it  is 
known  that  many  systems  of  interest  in  applications  are  flat.  For  example,  all  of  the  follow¬ 
ing  axe  flat:  mobile  robots  and  cars,  a  car  pulling  N  (properly  hitched)  trailers,  hopping 
robots,  underwater  vehicles,  chains  of  planar  rigid  bodies,  and  planar  satellite/manipulator 
systems.  Differentially  flat  systems  were  originally  studied  in  the  context  of  differential 
algebra  [e.g.,  7]  and  more  recently  using  more  traditional  geometric  control  methods  [e.g., 
17].  While  differential  flatness  can  be  viewed  as  a  generalization  of  dynamic  feedback 
linearization,  it  is  in  some  ways  more  natural  to  view  flatness  in  terms  of  trajectory  gener¬ 
ation.  This  perspective  is  the  one  taken  here,  and  our  approach  shall  be  to  exploit  flatness 
to  solve  the  problem  of  finding  a  feasible  trajectory  from  the  initial  configuration  to  the 
final  desired  configuration  x^,  and  then  to  design  a  control  strategy  to  track  this  desired 
trajectory  in  the  presence  of  uncertainty.  2-22 


13 


4.2  Control  Scheme 

Consider  the  problem  of  causing  the  mechanical  system  (1)  with  m  nonholonomic  con¬ 
straints  and  r  =  n  —  m  inputs  to  evolve  from  its  initial  state  (x(0),x2(0))  to  a  specified 
goal  state  (x<f,  0)  during  some  time  interval  [0,  T].  Suppose  that  a  vector  of  flat  outputs 
y  €  9£r  has  been  identified.  Note  that,  from  (14a),  there  is  a  set  of  outputs  and  deriva¬ 
tives  of  outputs  (y(0),y(0),  ...,y^(0))  corresponding  to  the  initial  state  and  another  set 
(y(T),  y(T), ...,  y^(T))  corresponding  to  the  goal  state.  The  trajectory  generation  prob¬ 
lem  can  therefore  be  reduced  to  the  problem  of  finding  a  curve  in  the  flat  output  space 
with  the  required  initial  and  final  location,  slope,  curvature,  and  so  on.  This  curve  fitting 
problem  is  standard  and  can  be  solved  in  a  number  of  ways,  and  any  solution  to  this  sim¬ 
ple  problem  then  defines  a  feasible  trajectory  for  the  full  system  (1)  which  produces  the 
requisite  motion. 

Observe  that  we  now  have  a  means  of  defining  a  trajectory  for  the  r-dimensional  flat 
output  y  which  ensures  that  the  nonholonomic  system  evolves  as  desired,  and  we  also  have 
a  strategy  for  controlling  the  r-dimensional  configuration  vector  x2  associated  with  the 
“reduced”  system  (3a).  Suppose  that  a  matrix  R  could  be  found  for  which  x  =  i?(x)y.  In 
this  case  the  system  (1)  could  be  reformulated  in  such  a  way  that  the  flat  output  y  and 
the  “reducing”  output  x2  are  equal,  and  then  a  natural  and  direct  means  of  controlling  the 
system  would  be  to  first  specify  the  desired  trajectory  y <*(£)  for  the  flat  output  and  then 
generate  the  control  input  F  which  ensures  that  x2(=  y)  in  (3a)  tracks  this  trajectory. 
Unfortunately,  the  next  lemma  shows  that  the  fiat  and  reducing  outputs  do  not  coincide 
except  in  the  trivial  case  of  a  holonomically  constrained  system. 

Lemma  2:  If  the  system  (1)  admits  a  flat  output  y  and  a  reducing  output  x2,  then  y  =  x2 
only  if  the  system  constraints  are  integrable. 

Proof:  Suppose  that  x2  is  a  vector  of  outputs  which  is  both  reducing  and  flat.  Then  the 
configuration  coordinates  x  =  [x^  must  satisfy  Xi  =  i?i(xi,x2)x2  for  some  R\  6 

3Jmxr  and  also  Xi  =  g(x2, ...,  x^)  for  some  function  g  and  integer  s.  Direct  differentiation 
of  Xi  =  g(x2 , ...,  x^ )  shows  that  xx  depends  on  derivatives  of  x2  through  order  s  + 1,  while 
the  relationship  Xi  =  Ui(gi(x2,  ...,X2^),x2)x2  indicates  that  Xi  depends  on  derivatives  of 
x2  through  order  max(l,  s).  Taken  together  these  results  imply  that  the  only  way  x2  can 
be  both  reducing  and  fiat  is  if  s  =  0,  but  in  that  case  Xi  =  g(x2)  and  the  constraints  are 
integrable.  ■ 

Lemma  2  shows  that  for  nonholonomic  systems  of  the  form  (1),  the  flat  and  reducing 

outputs  cannot  coincide.  As  a  consquence,  it  is  not  possible  to  simply  generate  a  trajectory 

for  the  flat  outputs  which  produces  the  desired  motion  and  then  track  this  trajectory 
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using  the  reduced  model  (3a).  However,  it  is  possible  to  adopt  essentially  this  strategy  by 
adding  one  additional  step  in  the  algorithm.  Thus  we  propose  the  following  solution  to 
the  equilibrium  point  stabilization  problem  for  differentially  flat  systems: 

Algorithm  1: 

1.  Given  an  initial  state  (x(0),x2(0))  and  a  goal  state  (x,*,  0),  determine  the  sets  of  flat 
outputs  and  derivatives  of  outputs  (y(0),y(0),  ...,y(,)(0))  and  (y(T),y(T),...,y(a)(T)) 
corresponding  to  the  initial  and  goal  states.  Find  a  smooth  curve  y d(t)  in  the  flat 
output  space  connecting  these  initial  and  final  output  values. 

2.  Use  the  relation  x2d(t)  =  g(y  <*(<),  ...,y^(t))  to  determine  the  desired  trajectory  for 
x2.  Observe  that  this  trajectory  completely  specifies  the  velocity  of  the  nonholonomic 
system  (1)  at  each  instant  and  therefore  ensures  that  the  full  system  state  evolves  as 
desired. 

3.  Track  the  desired  trajectory  x2d(t)  for  the  configuration  x2  of  the  reduced  system  (3a) 
through  proper  specification  of  the  control  input  F. 

In  order  to  successfully  implement  the  proposed  approach  to  equilibrium  point  stabi¬ 
lization,  it  is  necessary  to  ensure  that  the  (reduced)  mechanical  system  (3a)  closely  tracks 
the  desired  trajectory  x2(*  (where  it  is  assumed  that  x2(*  is  bounded  with  bounded  deriva¬ 
tives).  Our  objective  is  to  develop  a  control  scheme  which  is  capable  of  accurate  tracking 
control  without  rate  measurements  or  knowledge  of  the  system  dynamic  model.  Consider 
the  decentralized  adaptive  tracking  controller  obtained  from  (4)  through  the  introduction 
of  feedforward  elements  in  the  control  law: 

Fi  -  a,i(t)x2di  +  bi(t)x2di  +  fi(t)  +  kij2Wi  +  k2'r2ei  (15) 

ib,  =  -2jWi  +  72ej  for  i  =  1, 2, ...,  r 

where  e  =  x2d  —  x2  and  the  ctj(t)  and  bi(t)  are  (feedforward)  adaptive  gains  which  are 
adjusted  according  to  the  following  simple  update  laws  (the  adaptation  laws  for  the  fi(t) 
are  repeated  here  for  convenience  of  reference): 

fi  =  —°1  fi  +  Pl9i 

hi  =  -o2a,i  +  fcqihdi^  (16) 

bi  -  -a3bi  +  03qiX2di  for  i  =  1,2,  ...,r 

where  q  =  e  +  k2e/ki-y  —  w/7  represents  a  weighted  and  filtered  error  term  and  the  aj  and 
are  positive  scalar  adaptation  gains.  Observe  that  the  scheme  (15), (16)  is  implementable 
without  velocity  information  because,  althodfh  w,  f,  a,  and  b  depend  on  e,  the  control 
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law  requires  the  terms  w,  f,  a,  and  b  and  these  can  be  integrated  so  as  to  depend  only  on 
e. 

The  stability  properties  of  the  proposed  decentralized  adaptive  tracking  controller 
(15), (16)  are  summarized  in  the  following  theorem. 

Theorem  2:  The  adaptive  controller  (15), (16)  ensures  that  e,e,w,  f,  a,  b  are  semiglob- 
ally  uniformly  bounded  provided  7  is  chosen  sufficiently  large.  Moreover,  the  trajectory 
tracking  error  e,  e  is  guaranteed  to  converge  exponentially  to  a  compact  set  which  can  be 
made  arbitrarily  small. 

Proof:  Applying  the  control  law  (15)  to  the  mechanical  system  (3a)  yields  the  closed-loop 
system  dynamics 

H*e  +  V*ce  -f  &i72w  +  k2 72e  +  <&/  +  [ax2<f]  +  [bx2d]  +  Vcde  =  0  (17) 

where  the  notation  [st]  =  [siti,s2t2,  ...,sntn]  6  (for  any  two  n-vectors  s  and  t) 

is  introduced,  Vcd  =  yc*(x,x2(f),  and  $/  =  f  —  G*  —  H*x2d  —  VcdX2d-  Note  that  in 
obtaining  (17)  the  Coriolis/centripetal  term  was  expanded  as  follows:  V*cx.2d  —  Vcdx2  = 
Vcdx2d  —  V^e.  Consider  the  Lyapunov  function  candidate 

Vs  =  l,2  +  _lara+_Lbrb  (18) 

where  it  is  understood  that  V2  is  now  a  function  of  e,  e,  <f>f  rather  then  ea,  e3,  &d-  Note 
that  Vz  is  a  positive- definite  and  radially-unbounded  function  of  e,  e,  w,  <&/,  a,  and  b  if 
7  is  chosen  sufficiently  large.  Computing  the  derivative  of  (18)  along  (17)  and  simplifying 
using  the  adaptation  laws  (16)  yields 

^3  =  -(7  -  -j^)eTH*e  -  &  ||  e  ||2  -*l7  ||  w  ||2  +2wTH*e 

+  ieT[^-yc*e  -  Fc>]  -  qTVcde  +  —#£(*/  -  ftq) 

+  ^-aT(a  ~  fofoxid])  +  ^-bT(b  -  fo[<&2d]) 

<  ~((f  -  -  kcd)  ||  e  ||2  -$1  ||  e  ||2  -kl7  ||  w  ||2 

+  (2A M(^*)  +  —  +  *£H=“)  |fwl|||  e  ||  +-£—(kccvmax  +  kcd)  ||  e  ||||  e  || 

7  7  «i7 

+  II  e  nil  e  II2  II  w  nil  e  ||2  -g  ||  a  ||2  -g  ||  b  ||2 

+  (19) 
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where  kcd  is  an  upper  bound  on  Vcd  (i.e.,  ||  Vcd  ||F  <  kcd  Vx)  and  vmax  is  an  upper  bound 
on  ||  X2 d  ||-  Manipulations  similar  to  those  used  in  the  proof  of  Theorem  1  then  permit 
the  following  upper  bound  on  V3  in  (19)  to  be  established: 


V,  -  “((2  -  -  ka)  11  ®  l|2  “TT  11  e  1,2  11  w  1,2 

+  (2\M(ff‘)  +  hi  +  ||  w  mi  e  || 

7  7 

+  +  hi)  II  e  mi  e  II  II  e  II II  e  ||2  ||  w  ||||  e  |[2 

"'“I  I' 3 'I' "I  11  b  I!' +|  W 


where  773  is  a  positive  scalar  constant. 

Let  z  =  [||  e  ||  ||  e  ||  ||  w  ||]3',  *  =  [||  ||  ||  a  ||  ||  b  \\]T,  0min  =  min  (ft), 

ftmax  =  max(/3j),  and 


Q  = 


hi  _ v  +  k  A 

k1  2k\‘i''n'ccUrnax  '  ^cd) 

ifcChcVn.,  +  hi)  (J  -  &)A m(S~)  -  hi 
0  -Ajf(ir)  - 


0 

A M(#*)-^ 
hi 


and  note  that  Q  is  positive-definite  if  7  is  chosen  large  enough.  If  (3max  /  Pmin  is  fixed  then 
there  exist  positive  scalar  constants  774,  775  that  do  not  increase  as  7  and  Pmin  increase, 
and  positive  scalar  constants  A,  independent  of  7  and  Pmin,  such  that  14  and  V3  in  (18) 
and  (20)  can  be  bounded  as 

A.  II  z  ||2  +A-  ||  9  ||2<  <  A,  ||  z  ||2  +A-  ||  *  ||2 

Pmin  Pmin 

Vz  <  -(MQ)  -  ^Vz112)  II  z  II2  -A-  II  *  II2  +J0- 

I  Pmin  Pmin 


Now  choose  two  scalar  constants  Vm,  Vm  so  that  Vm  >  Vm  >  14(0),  and  define  cm  = 
Am(Q)  ~  r}4VM1/2/7;  then  choose  7  large  enough  so  that  cm  >  0  (this  is  always  possible). 
Let  S  —  max( A3 /c^f,  A4/A5)  and  choose  P0  so  that  r)5S/P0  <  Vm  (this  is  always  possible). 
Then  selecting  Pmin  >  Po  ensures  that  if  Vm  <  14  <  Vm  then  V3  <  0.  This  condition 
together  with  Vm  >  Vm  >  14(0)  implies  that  14(<)  <  Vm  Vt,  so  that  c(t)  =  Am(Q)  — 
r?4V31/2(t)/j  >  CM  >  0  Vf  and  — ■  ~ 


14  <  —cm  II  z 


(Po  +  A/?) 


V5 


(Po  +  A/?) 


where  A/3  =  (3mtn  —  Po  and  it  is  assumed  that  Pmin  is  chosen  so  that  A/3  >  0.  An  analysis 
identical  to  the  one  used  in  the  proof  of  TheoJfem  1  now  applies  and  permits  the  conclusion 
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that  ||  z  ||,  ||  ||  are  uniformly  bounded,  which  implies  that  e,  e,  w,  f,  a,  and  b  are 

uniformly  bounded.  Moreover,  j|  z  ||,  j|  ||  converge  exponentially  to  the  closed  balls  Bri , 
respectively,  where 


\^i(A)  +  A/?)/ 


Observe  that  the  radius  of  the  ball  to  which  |[  z  ||2  =  j|  e  ||2  +  ||  e  ||2  +  ||  w  ||2  is  guaranteed 
to  converge  can  be  decreased  as  desired  simply  by  increasing  A/3 .  ■ 

Several  observations  can  be  made  concerning  the  proposed  strategy  for  equilibrium 
point  stabilization  of  differentially  flat  nonholonomic  mechanical  systems.  First,  this 
scheme  provides  a  simple  and  computationally  efficient  means  of  ensuring  that  the  sys¬ 
tem  (1)  evolves  from  any  initial  configuration  to  any  desired  goal  configuration  x^.  The 
simplicity  and  efficiency  is  a  consequence  of  that  of  the  trajectory  generation  algorithm  for 
the  fiat  outputs  and  the  adaptive  tracking  controller  for  the  reducing  outputs.  Trajectory 
generation  can  be  achieved  using  standard  curve  fitting  techniques  once  the  flat  outputs 
are  identified  (see,  for  example,  [7]  for  a  discussion  of  this  problem),  and  the  tracking 
scheme  (15), (16)  possesses  a  simple  decentralized  structure,  utilizes  only  measurements 
of  system  configuration,  and  requires  virtually  no  information  concerning  the  system  dy¬ 
namics.  Thus  the  proposed  control  scheme  provides  an  efficient,  modular,  and  readily 
implementable  approach  to  equilibrium  point  stabilization.  It  is  shown  that  the  tracking 
controller  ensures  uniform  boundedness  of  signals  and  that  the  ultimate  bounds  on  the 
trajectory  tracking  errors  can  be  reduced  arbitrarily  by  increasing  the  adaptation  gains  /3j] 
notice  that  exponential  convergence  ensures  that  the  transient  behavior  of  the  system  will 
be  well-behaved.  It  is  mentioned  that,  for  clarity  of  exposition,  the  calculations  required  in 
the  proof  of  Theorem  2  have  been  condensed  significantly  in  the  above  presentation;  these 
details  are  given  in  the  report  [29].  An  additional  observation  is  that  the  scalar  controller 
gains  iq,  &2,  7,  and  f3  can  be  replaced  with  the  appropriate  diagonal  matrices  for  increased 
flexibility  and  performance;  scalar  gains  are  used  above  for  simplicity  of  development.  Fi- 
nally,  it  is  easy  to  see  that  the  solution  to  t&e  equilibrium  point  stabilization  problem  given 
in  Algorithm  1  also  provides  a  solution  to  a  broad  class  of  trajectory  tracking  problems 
for  fiat  systems:  given  any  feasible  trajectory  for  the  system  the  corresponding  flat  output 
trajectory  y<j(f)  can  be  determined,  and  this  trajectory  can  then  be  tracked  using  (steps  2 
and  3  of)  Algorithm  1. 
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4.3  Implementation  Results 

The  decentralized  adaptive  equilibrium  point  stabilization  scheme  described  above  is 
now  applied  through  computer  simulation  to  the  two  wheel  mobile  robot  shown  in  Figure 
1  and  described  in  (13).  This  system  is  differentially  flat  with  flat  outputs  y  =  [a:  y]T 
and  reducing  outputs  X2  =  [ y  9]T.  The  simulation  study  consists  of  two  parts:  a  system 
stabilization  study  and  a  trajectory  tracking  investigation.  The  first  simulation  illustrates 
the  capability  of  Algorithm  1  to  stabilize  the  system  to  the  origin  from  various  initial 
configurations  in  the  presence  of  external  disturbances  and  with  no  model  information  or 
rate  measurements.  The  trajectory  generation  phase  of  Algorithm  1  is  accomplished  by 
fitting  a  fourth  order  polynomial  for  each  flat  output  coordinate  to  the  given  initial  and  goal 
configurations  (in  all  cases  the  system  is  initially  at  rest).  This  output  curve  is  then  used 
to  specify  a  desired  trajectory  for  the  reducing  output  X2d(t),  and  the  adaptive  tracking 
scheme  (15), (16)  is  used  to  track  this  trajectory.  The  stabilization  algorithm  is  applied  to 
the  mathematical  model  of  the  mobile  robot  described  above  and  shown  in  Figure  1  through 
computer  simulation  with  a  sampling  period  of  two  milliseconds.  All  integrations  required 
by  the  controller  axe  implemented  using  a  simple  trapezoidal  integration  rule  with  a  time- 
step  of  two  milliseconds.  Additional  details  concerning  the  simulation  strategy  for  this 
study  can  be  found  in  [29].  The  system  model  parameters  and  disturbances  are  defined  as 
in  the  simulation  described  in  Section  3.2,  and  the  adaptive  controller  parameters  k\,  k 2, 7 
are  also  set  to  the  values  used  in  that  simulation.  The  adaptive  gains  f,  a,  b  axe  set 
to  zero  initially  and  the  adaptation  parameters  are  set  as  follows:  oy  =  0.1,  f3j  =  10 
for  i  —  1, 2, 3.  Note  that  no  attempt  was  made  to  “tune”  the  gains  to  obtain  the  best 
possible  performance.  The  stabilization  scheme  given  in  Algorithm  1  was  tested  using  a 
wide  range  of  initial  conditions;  sample  results  are  given  in  Figure  3,  and  indicate  that 
accurate  equilibrium  point  stabilization  of  the  system  is  achieved. 

The  next  simulation  demonstrates  the  capability  of  Algorithm  1  to  provide  accurate 
trajectory  tracking  control  of  the  mobile  robot  shown  in  Figure  1  and  described  in  (13). 
Additionally,  this  simulation  indicates  that  a  simple  collision  avoidance  capability  can  be 
naturally  included  with  this  approach  to  mobile  robot  control.  The  nominal  task  in  this 
simulation  is  to  track  a  straight  fine  trajectory  in  flat  output  space  from  y initial  =  [0  3]T 
to  y final  =  [8  3]T.  However,  two  large  rectangular  obstacles  are  placed  in  the  robot 
workspace  as  shown  in  Figure  4,  thus  preventing  completion  of  the  nominal  task.  In  order 
to  provide  the  robot  with  a  simple  collision  avoidance  capability,  Algorithm  1  is  modified 
as  follows:  if  an  obstacle  is  encountered  during  the  task,  the  nominal  path  is  altered 
in  such  a  way  that  the  robot  tracks  along  the  obstacle  until  a  configuration  is  reached 
which  permits  a  smooth  output  trajectory  connect  this  configuration  to  the  original 
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goal  configuration  (see  Figure  4).  The  collision  avoidance  strategy  is  essentially  “local”  in 
character  and  provides  a  setting  within  which  both  model-based  and  sensor-based  strategies 
can  be  implemented.  The  scheme  is  based  on  the  robotic  manipulator  collision  avoidance 
work  reported  in  [32],  and  is  described  in  more  detail  in  [29],  The  focus  here  is  on  the 
role  of  Algorithm  1  as  a  trajectory  tracking  controller  for  flat  nonholonomic  systems;  a 
detailed  discussion  of  the  collision  avoidance  method  is  beyond  the  scope  of  the  paper  and 
thus  is  not  provided.  Algorithm  1  is  implemented  exactly  as  in  the  first  simulation  in 
this  section,  with  the  exception  of  the  collision  avoidance  modification  described  above. 
Sample  results  are  given  in  Figure  4  and  indicate  that  accurate  trajectory  tracking  (and 
successful  collision  avoidance)  is  achieved  with  the  proposed  strategy. 

5.  Adaptive  Control  of  Caplygin  Systems 

While  differential  flatness  appears  to  be  a  useful  system  property  to  exploit  when 
solving  the  trajectory  generation  problem  for  (flat)  nonholonomic  systems,  there  are  at 
present  no  general  methods  for  determining  whether  or  not  a  given  system  is  flat  or,  if 
a  system  is  flat,  what  the  flat  outputs  might  be.  Thus  it  is  desirable  to  identify  other 
structural  properties  of  nonholonomic  systems  that  can  be  used  for  control  purposes,  and 
to  develop  strategies  for  controlling  systems  which  possess  this  structure.  In  this  section  we 
focus  our  attention  on  so-called  Caplygin  systems  and  present  an  algorithm  for  stabilizing 
these  systems  to  an  equilibrium  configuration  in  the  presence  of  uncertainty. 

5.1  Background 

Consider  the  system  (1)  with  configuration  coordinates  x  =  [x^  and  suppose 

that  the  Lagrangian  (of  the  free  system)  £(x,  x)  =  xr#(x)x/2  —  Z7(x)  and  the  input  and 
constraint  matrices  Af,  A  are  independent  of  Xj.  In  this  case  the  nonholonomic  system  (1) 
becomes 


M(x2)T  =  tf(x2)x  +  Vcc(x2,x)x  +  G(x2)  +  AT(x2)\ 

A{x2)±  =  0  (21) 

and  is  known  as  a  (controlled)  Caplygin  system  [12].  This  functional  form  for  the  system 
dynamics  is  an  expression  of  symmetries  in  the  problem,  and  such  symmetries  occur  nat¬ 
urally  in  many  physical  systems  of  interest  in  applications  [12].  For  the  Caplygin  system 
(21),  the  reduced  system  (3)  takes  the  form 

F  =  tf*(x2)x2  +  Vc*(x2,x2)x2  +  G*(x2) 

Xi=A*(x2)x2  2-29 
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(22a) 
(22  b) 


Observe  that  (22a)  is  a  2r  order  differential  equation  which  defines  the  evolution  of  the  2r 
states  (x2,X2).  As  a  consequence,  X2  constitutes  a  vector  of  coordinates  for  the  reduced 
configuration  space  for  the  system  (22a);  this  space  is  often  referred  to  as  the  “base  space” 
of  the  complete  system  (22)  [12,17]. 

The  evolution  of  the  base  space  coordinates  can  be  controlled  by  properly  specifying  the 
input  F.  Indeed,  it  is  easy  to  see  that  the  adaptive  scheme  (15), (16)  can  be  used  to  cause  X2 
to  track  any  desired  trajectory  X2 d  in  the  presence  of  model  and  measurement  uncertainty. 
Given  a  trajectory  for  X2 ,  the  behavior  of  the  remaining  configuration  coordinates  Xi  is  then 
completely  determined  by  the  kinematic  relationship  (22b).  This  observation  is  used  in 
[12]  to  develop  an  algorithm  for  stabilizing  the  full  system  configuration  x.  The  algorithm 
proposed  in  [12]  is  based  on  the  use  of  geometric  phase,  which  is  the  extent  to  which  a 
closed  path  in  the  base  space  fails  to  be  closed  in  the  configuration  space,  to  maneuver 
both  Xi  and  X2  to  their  desired  values.  In  what  follows  we  present  a  version  of  this  control 
algorithm  which  is  implementable  without  model  information  or  rate  measurements. 

5.2  Control  Scheme 

Consider  the  problem  of  transferring  the  Caplygin  system  (22)  from  any  initial  state 
to  an  equilibrium  point,  and  suppose  for  simplicity  of  notation  that  the  goal  configuration 
is  the  origin.  Let  (x°,x®,x®)  denote  the  initial  state,  and  consider  the  following  algorithm 
for  stabilizing  (22)  to  the  origin. 

Algorithm  2: 

1.  Given  an  initial  state  (x®,X2,x®),  drive  the  system  (22a)  to  the  origin  in  the  (x2,x2) 
base  state  space  in  finite  time  using  the  trajectory  tracking  scheme  (15), (16).  Observe 
that  this  can  be  accomplished  using  any  smooth  path  between  (x®,x®)  and  (0,0)  and 
that,  in  general,  the  resulting  state  of  the  system  will  be  (x[,  0, 0)  for  some  nonzero 

xi* 

2.  Find  a  closed  path  (or  series  of  closed  paths)  in  the  base  space  which  produces  the 
desired  geometric  phase  in  the  configuration  space,  so  that  (xj,0, 0)  is  transfered  to 
(0,0,0). 

3.  Track  the  desired  base  space  trajectory  X2 d(t)  using  the  controller  (15), (16). 

In  order  to  successfully  implement  this  approach  to  stabilizing  (22),  we  must  determine  a 
closed  path  V  in  the  base  space  which  satisfies  the  geometric  phase  condition 

x\=-  f  A*(x2)dx2  (23) 

JV 

Explicit  characterization  of  the  closed  path -T^Kvhich  satisfies  (23)  can  be  achieved  for  sev- 
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era!  systems  of  practical  interest;  see,  for  instance,  [12,17]  and  the  simulations  presented 
below.  However,  some  systems  may  require  a  more  general  computational  approach.  For 
example,  an  algorithm  for  approximately  characterizing  the  closed  path  based  on  Lie  al¬ 
gebraic  methods  is  given  in  [33].  Alternatively,  numerical  integration  can  be  utilized  to 
iteratively  determine  a  closed  path  with  the  requisite  geometric  phase  [17].  In  any  case, 
once  this  closed  path  in  the  base  space  is  found  it  is  clear  that  Algorithm  2  will  produce 
the  desired  stabilization  of  (22). 

It  is  interesting  to  note  that  Algorithm  2  is  actually  applicable  to  a  class  of  non- 
holonomic  systems  which  is  more  general  than  the  class  of  Caplygin  systems  (21).  To 
see  this,  observe  that  this  approach  to  system  stabilization  can  be  implemented  with  the 
nonholonomic  system 


F  =  iP(x)x2  +  Vc*c(x,x2)x2  +  G*(x)  (24a) 

xi  =  A*(x2)x2  (246) 

in  which  only  the  constraint  matrix  A  is  independent  of  Xi  and  the  Lagrangian  and  input 
matrix  are  functions  of  the  full  configuration  x.  This  fact  follows  directly  from  the  proof 
of  Theorem  2,  where  it  is  shown  that  the  tracking  scheme  (15), (16)  is  appropriate  for  the 
system  (24a),  and  the  fact  that  the  evolution  of  Xi  is  still  completely  determined  by  the 
kinematic  relationship  (24b). 

5.3  Implementation  Results 

The  adaptive  equilibrium  point  stabilization  scheme  described  in  Algorithm  2  is  now 
applied  through  computer  simulation  to  two  different  nonholonomic  mechanical  systems: 
the  two  wheel  mobile  robot  shown  in  Figure  1  and  discussed  above,  and  the  “planar  skater” 
depicted  in  Figure  5  and  described  in  [34].  The  first  simulation  involves  the  mobile  robot. 
Observe  that  the  change  of  coordinates  z\  =  rcos(0  +  7r/2)  +  ysin(#  +  7r/2),  z?  =  0  - f  7r/2, 
zz  =  —  xsin(0  +  7t/2)  +  ycos(0  +  7r/2)  permits  the  model  (13)  to  be  rewritten  [12] 

mzi  =  —mz\z\  +Pi(Ti  +  T2)  +  Pzz3{Ti  —  T2)  +  d\(t)  (25a) 

Jh  =p2(Ti  -T2)  +  d2(t)  (256) 

zz  =  —Z\  z2  __  (25c) 

where  the  pt-  are  positive  constants  and  all  other  terms  are  as  defined  in  (13).  It  can  be 
seen  that  this  system  is  Caplygin  with  base  coordinates  (zi,  z2).  The  dynamics  for  the  base 
space  is  defined  by  (25a), (25b)  and  the  evolution  of  z3  is  determined  through  the  simple 
kinematic  relation  (25c).  Note  that  the  kinMnatics  is  simple  enough  to  be  integrated  in 
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closed  form  in  this  case,  and  that  the  resulting  geometric  phase  condition  z\  =  Z\Z2  makes 
the  problem  of  finding  a  closed  path  V  in  base  coordinates  which  stabilizes  23  particularly 
easy. 

The  control  objective  in  this  simulation  is  to  stabilize  the  system  to  the  origin  in  the 
presence  of  external  disturbances  and  with  no  model  information  or  rate  measurements. 
This  objective  is  achieved  using  the  three  step  procedure  given  in  Algorithm  2.  In  the 
first  step,  the  initial  base  configuration  (zj,z£)  is  driven  to  (0,0)  along  a  straightline 
path  using  the  tracking  controller  (15), (16).  Then,  given  the  resulting  value  for  z\ ,  a 
rectangular  closed  path  V  of  proper  width  and  length  is  defined  in  the  base  space;  this 
path  is  then  tracked  with  the  adaptive  controller  (15), (16).  The  stabilization  algorithm  is 
applied  to  the  mathematical  model  of  the  mobile  robot  described  above  and  shown  in  Figure 
1  through  computer  simulation  with  a  sampling  period  of  two  milliseconds.  All  integrations 
required  by  the  controller  are  implemented  using  a  simple  trapezoidal  integration  rule  with 
a  time-step  of  two  milliseconds.  Additional  details  concerning  the  simulation  strategy  for 
this  study  can  be  found  in  [29].  The  system  model  parameters  and  disturbances  are 
defined  as  in  the  simulation  presented  in  Section  3.2,  and  the  adaptive  controller  (15), (16) 
is  implemented  exactly  as  described  in  Section  4.3.  The  stabilization  scheme  given  in 
Algorithm  2  was  tested  using  a  wide  range  of  initial  conditions  and  external  disturbances. 
Sample  results  are  given  in  Figure  6a-6d,  and  indicate  that  accurate  equilibrium  point 
stabilization  of  the  mobile  robot  is  achieved. 

The  next  simulation  demonstrates  the  capability  of  Algorithm  2  to  stabilize  the  orien¬ 
tation  of  the  planar  skater  shown  in  Figure  5  and  described  in  [34].  This  system  represents 
a  simple  model  of  mechanical  systems  for  which  angular  momentum  conservation  induces 
a  nonholonomic  constraint  on  the  system  dynamics.  Let  the  configuration  of  the  system 
be  defined  by  x  =  [0  <pj  (f> 2]^,  where  6  is  the  absolute  angle  of  the  center  body  with 
respect  to  some  fixed  coordinate  system  and  4>\ ,  02  are  the  angles  of  the  two  arms  of  the 
system  relative  to  the  center  body.  The  dynamic  model  for  this  system  is  presented  in 
detail  in  [34],  and  therefore  the  specifics  are  not  repeated  here.  Instead,  we  simply  note 
that  the  system  is  Caplygin,  has  base  coordinates  X2  =  [©i  ^2]^  which  evolve  according  to 
a  dynamic  model  of  the  general  form  (22a),  and  is  nonholonomically  constrained  through 
conservation  of  angular  momentum.  If  it  assumed  that  the  system  has  zero  momentum 
initially  then  this  constraint  defines  the  evolution  of  6  through  a  kinematic  equation  of 
the  form  (22b).  Thus  Algorithm  2  is  directly  applicable  to  stabilizing  the  system  to  an 
equilibrium  configuration. 

The  control  objective  in  this  simulation  is  to  stabilize  the  planar  skater  to  the  origin  in 
configuration  space  in  the  presence  of  extejjn^disturbances  and  with  no  model  information 
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or  rate  measurements.  This  objective  is  achieved  using  the  three  step  procedure  given 
in  Algorithm  2.  This  stabilization  algorithm  is  implemented  exactly  as  it  was  with  the 
mobile  robot  in  the  first  part  of  this  section,  with  the  exception  of  the  determination  of 
a  closed  path  V  in  base  coordinates  which  stabilizes  6.  In  the  present  case  the  geometric 
phase  condition  (23)  could  not  be  integrated  in  closed  form,  so  that  the  calculation  of  an 
appropriate  path  V  given  the  value  x\  =  61  was  performed  numerically.  The  stabilization 
scheme  was  tested  using  a  wide  range  of  initial  conditions.  Sample  results  are  given  in 
Figure  7a- 7f,  and  indicate  that  accurate  equilibrium  point  stabilization  of  the  planar  skater 
is  achieved.  Figure  7a  depicts  the  initial  configuration  of  the  skater  and  Figure  7b  shows 
the  (user  defined)  “zero”  configuration  for  the  base  coordinates;  observe  that  in  driving 
the  system  to  the  origin  in  base  space  the  6  coordinate  does  not  reach  zero.  Figures  7c-7f 
then  shows  a  closed  path  in  base  space  which  causes  the  6  coordinate  to  be  transfered  from 
its  value  in  Figure  7b  (approximately  45°)  to  its  desired  final  value  of  0°. 

6.  Conclusions 

This  paper  considers  the  problem  of  controlling  nonholonomic  mechanical  systems 
in  the  presence  of  incomplete  information  concerning  the  system  model  and  state,  and 
presents  a  new  class  of  decentralized  adaptive  controllers  as  a  solution  to  this  problem.  The 
proposed  control  strategies  provide  simple  and  robust  solutions  to  a  number  of  important 
nonholonomic  system  control  problems,  including  stabilization  of  general  systems  to  an 
equilibrium  manifold  and  stabilization  of  differentially  fiat  and  Caplygin  systems  to  an 
equilibrium  point.  Additionally,  it  is  indicated  how  the  proposed  approach  to  equilibrium 
point  stabilization  of  flat  systems  can  be  applied  to  trajectory  tracking  control.  All  of 
the  schemes  possess  a  simple  decentralized  structure,  are  computationally  efficient  and 
implementable  without  system  model  or  rate  information,  and  ensure  uniform  boundedness 
of  all  signals  and  accurate  motion  control.  Computer  simulation  results  are  provided 
which  demonstrate  stabilization  of  a  mobile  robot,  stabilization  of  a  planar  skater,  and 
trajectory  tracking  control  with  simple  collision  avoidance  for  a  mobile  robot.  Future 
work  will  focus  on  the  development  of  adaptive  equilibrium  point  stabilization  schemes  for 
additional  classes  of  nonholonomic  mechanical  systems. 
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Figure  1:  Graphical  Representation  of  Mobile  Vehicle 


Figure  2a:  Response  of  output  coordinate  y  in  equilibrium  manifold  stabilization 
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Figure  6a:  Response  of  base  coordinate  z\  in  mobile  robot  simulation 
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Figure  6b:  Response  of  base  coordinate mobile  robot  simulation 


Figure  7 :  Evolution  of  skater  configuration  in  equilibrium  point  stabilization  simulation 
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ELECTRICALLY-DRIVEN  MANIPULATORS 
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Abstract 

This  paper  presents  two  adaptive  schemes  for  compliant  motion  control  of  uncertain 
rigid-link,  electrically-driven  manipulators.  The  first  strategy  is  developed  using  an  adap¬ 
tive  impedance  control  approach  and  is  appropriate  for  tasks  in  which  the  dynamic  character 
of  the  end-effector /environment  interaction  must  be  controlled,  while  the  second  scheme 
is  an  adaptive  position/force  controller  and  is  useful  for  those  applications  that  require  in¬ 
dependent  control  of  end-effector  position  and  contact  force.  The  proposed  controllers  are 
very  general  and  computationally  efficient,  and  can  be  implemented  with  virtually  no  a  pri¬ 
ori  information  concerning  the  manipulator/ actuator  dynamic  model  or  the  environment 
It  is  shown  that  the  schemes  ensure  (semiglobal)  uniform  boundedness  of  all  signals,  and 
that  the  ultimate  size  of  the  system  errors  can  be  made  arbitrarily  small.  The  capabilities 
of  the  proposed  control  strategies  are  illustrated  through  both  computer  simulations  and 
laboratory  experiments  with  a  six  degree-of-freedom  (DOF)  IMI  Zebra  Zero  manipulator. 

1.  Introduction 

The  problem  of  controlling  robotic  manipulators  that  come  into  contact  with  objects 
in  their  workspace  is  of  central  importance  in  many  applications.  Manipulators  are  subject 
to  interaction  forces  whenever  they  perform  tasks  involving  motion  which  is  constrained 
by  the  environment,  such  as  assembling  parts  or  executing  grinding  or  finishing  opera¬ 
tions.  In  such  motions,  the  interaction  forces  must  be  accommodated  rather  than  resisted, 
so  as  to  comply  with  the  environmental  constraints.  Recognizing  the  importance  of  such 
compliant  motion,  many  researchers  have  studied  this  problem  for  rigid-link  manipulators. 
This  intensive  investigation  has  produced  two  basic  approaches  to  achieving  compliant 
motion:  position/force  control  [e.g.,  1]  and  impedance  control  [e.g.,  2].  The  first  approach 
is  based  on  the  observation  that  when  the  manipulator  end-effector  is  in  contact  with  the 
environment,  the  space  of  the  end-effector  coordinates  can  be  naturally  decomposed  into 
a  “position  subspace”  and  a  “force  subspace”;  these  two  subspaces  correspond  to  the  di¬ 
rections  in  which  the  end-effector  is,  resp&tf?ely,  able  to  move  and  able  to  exert  forces 
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on  the  environment.  The  position/force  control  approach  to  compliant  motion  is  then  to 
define  and  track  a  position  (and  orientation)  trajectory  in  the  position  subspace  and  a 
force  (and  moment)  trajectory  in  the  force  subspace.  Alternatively,  the  impedance  control 
approach  proposes  that  the  control  objective  should  be  the  regulation  of  the  mechanical 
impedance  of  the  robot  end-effector.  Thus  the  objective  of  the  impedance  controller  is  to 
maintain  a  desired  dynamic  relationship  between  the  end-effector  position  and  the  end- 
effect  or/ environment  contact  force.  It  is  noted  that  each  of  the  approaches  to  compliant 
motion  control  summarized  above  has  desirable  properties  and  is  well-suited  to  particular 
robotic  tasks.  For  example,  position/force  control  permits  the  end-effector  position  and 
contact  force  to  be  explicitly  and  independently  controlled,  which  is  necessary  in  applica¬ 
tions  such  as  contact-based  surface  inspection  or  remediation  of  hazardous  and  unstruc¬ 
tured  environments.  Alternatively,  the  impedance  control  formulation  provides  a  unified 
framework  for  considering  both  unconstrained  and  constrained  motion  control  problems 
and  for  accommodating  transition  between  these  two  operating  modes;  as  a  consequence 
impedance  control  has  been  found  useful  for  tasks  such  as  deburring  of  machined  parts  or 
mechanical  assembly. 

During  the  past  decade,  there  has  been  considerable  interest  in  designing  compliant 
motion  control  strategies  for  rigid-link  manipulators  which  can  perform  well  despite  incom¬ 
plete  information  concerning  the  system  model.  Most  of  the  controllers  proposed  thusfar 
have  been  developed  by  using  either  adaptive  control  methods  or  robust  control  techniques 
to  compensate  for  uncertainty  in  the  manipulator  model.  It  is  noted  that  adaptive  control 
methods  in  particular  have  been  shown  to  provide  an  attractive  means  of  handling  the 
model  uncertainty.  Included  among  the  advantages  of  this  approach  is  a  reduced  depen¬ 
dence  upon  a  priori  model  information,  the  capacity  to  respond  quickly  to  unexpected 
events,  and  the  potential  to  continuously  improve  performance  while  the  manipulator  is 
in  operation.  An  important  limitation  of  the  controllers  summarized  above  is  that  these 
schemes  have  been  designed  by  neglecting  the  actuator  dynamics  and  assuming  that  the 
joint  torques  can  be  commanded  directly.  However,  as  demonstrated  in  [e.g.,  3-5],  the 
actuator  dynamics  constitute  an  important  component  of  the  complete  manipulator  dy¬ 
namic  model.  Recognizing  the  potential  difficulties  associated  with  ignoring  the  effects  of 
the  actuator  dynamics,  several  researchers  have  recently  considered  the  problem  of  con¬ 
trolling  the  motion  of  rigid-link,  electrically- driven  (RLED)  manipulators  [5-16].  In  much 
of  this  work,  the  controller  development  requires  full  knowledge  of  the  complex  dynamic 
models  for  the  manipulator  and  actuators  [e.g.,  5-9].  Research  in  which  controllers  are 
designed  with  the  capability  to  compensate  for  uncertainty  in  the  manipulator/actuator 
system  includes  work  on  robust  control  stfatggies  [10,11]  and  adaptive  schemes  [12-16]. 
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It  is  noted  that  implementation  of  most  of  these  robust  and  adaptive  controllers  requires 
the  calculation  of  complex  manipulator-specific  quantities,  such  as  the  time-derivative  of 
the  manipulator  “regressor”  matrix  or  upper  bounds  on  the  derivatives  of  the  “fictitious 
controls”;  this,  in  turn,  limits  the  generality  and  applicability  of  these  strategies.  Addi¬ 
tionally,  most  of  this  research  has  focused  on  the  (unconstrained)  motion  control  problem, 
despite  the  fact  that  the  effects  of  the  actuator  dynamics  is  expected  to  very  important 
in  constrained  motion  applications.  Some  recent  progress  on  compliant  motion  control  of 
RLED  manipulators  is  reported  in  [17]. 

This  paper  considers  the  compliant  motion  control  problem  for  -uncertain  RLED  ma¬ 
nipulators,  and  presents  two  new  adaptive  schemes  as  solutions  to  this  problem.  The 
proposed  controllers  possess  a  simple  and  modular  structure,  are  easy  to  implement,  and 
requires  virtually  no  information  regarding  the  manipulator/ actuator  system  model.  The 
first  scheme  is  an  impedance  controller  which  adaptively  generates  the  joint  torques  re¬ 
quired  to  provide  the  desired  dynamic  relationship  between  end-effector  position  and  con¬ 
tact  force,  while  the  second  strategy  is  an  adaptive  position/force  controller  which  ensures 
that  the  desired  end-effector  position  trajectory /force  setpoint  is  achieved.  It  is  shown 
that  the  schemes  ensure  semiglobal  uniform  boundedness  of  all  signals,  and  that  the  ulti¬ 
mate  size  of  the  system  errors  can  be  made  arbitrarily  small.  The  efficacy  of  the  proposed 
schemes  is  illustrated  through  computer  simulations  and  hardware  experiments  with  an 
IMI  Zebra  Zero  manipulator. 

2.  Preliminaries 

This  paper  considers  the  adaptive  compliant  motion  control  problem  for  RLED  ma¬ 
nipulators.  Let  p  €  define  the  position  and  orientation  of  the  manipulator  end-effector 
relative  to  a  fixed  user-defined  reference  frame  and  6  €  denote  the  vector  of  manipulator 
joint  coordinates.  Then  the  forward  kinematic  and  differential  kinematic  maps  between 
the  robot  joint  coordinates  6  and  the  end-effector  coordinates  p  can  be  written  as 

P = h(<>),  p  =  mi>  (i) 

where  h  :  3tn  — ►  is  smooth  and  J  €  3£mXn  is  the  end-effector  Jacobian  matrix. 

Observe  that  there  are  numerous  advantages  to  formulating  the  manipulator  control 
problem  directly  in  terms  of  the  end-effector  coordinates  p.  For  example,  these  coordinates 
axe  typically  more  task-relevant  than  the  joint  coordinates  6,  so  that  developing  the  con¬ 
troller  in  terms  of  p  can  lead  to  improved  performance,  efficiency,  and  implement  ability. 
Note  that  such  a  formulation  is  particularly  useful  for  compliance  control  applications, 
since  the  compliant  motion  objectives  are^  staled  most  naturally  in  terms  of  task-space 
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variables.  If  the  manipulator  is  nonredundant  (so  that  m  =  n)  and  is  in  a  region  of  the 
workspace  where  J  has  full  rank,  then  p  and  9  are  diffeomorphic  and  this  formulation 
presents  no  difficulties.  A  task-space  formulation  can  also  be  realized  if  the  manipulator  is 
kinematically  redundant  (so  that  m  <  n)  by  utilizing,  for  example,  the  configuration  con¬ 
trol  approach  [e.g.,  18,19].  In  what  follows,  we  shall  consider  nonredundant  and  redundant 
robots  together  and  introduce  a  set  of  n  task-space  coordinates  x  obtained  by  augmenting 
p  with  n  —  m  kinematic  functions  that  define  some  auxiliary  task  to  be  performed  by  the 
manipulator  [18,19].  To  retain  generality,  we  shall  require  only  that  the  kinematic  rela¬ 
tionship  between  9  and  x  is  known  and  smooth  and  can  be  written  in  a  form  analogous  to 
(1): 

x  =  hB(0),  x  =  Ja{9)9  (2) 

where  ha  :  -»  and  Ja  £  ftnXn.  Observe  that  for  x  to  be  a  valid  task-space 

coordinate  vector  the  elements  of  x  must  be  independent  in  the  region  of  interest;  thus  it 
will  be  assumed  in  our  development  that  Ja  is  of  full  rank. 

Consider  an  n  DOF  rigid-link  manipulator  actuated  by  armature-controlled  DC  mo¬ 
tors.  Let  T  £  denote  the  vector  of  associated  joint  torques,  u  £  represent  the  vector 
of  armature  voltages,  and  I  £  3£n  define  the  vector  of  armature  currents.  The  dynamic 
model  for  this  RLED  manipulator  system  is  a  3nth  order  differential  equation  relating  the 
joint  coordinates  9  and  the  system  control  input  u.  For  example,  in  the  event  that  9,  9, 
and  T  are  measurable,  the  following  dynamic  model  for  the  manipulator/actuator  system 
is  convenient  for  subsequent  analysis  [e.g.,  5]: 

T  =  He{9)9  +  Ve(9, 9)9  +  G  e(9)  +  J?{9)  P  (3a) 

u  =  MT  +  RT  +  K9  (zb) 

where  He  €  3£nxn  is  the  system  inertia  matrix,  Ve  £  3£nXn  and  Ge  £  quantify  Corio¬ 
lis/centripetal  acceleration  and  gravity  effects,  respectively,  P  £  3£n  is  the  vector  of  forces 
and  moments  exerted  by  the  end-effector  on  the  environment.,  and  M,R,K  £  3£nXn  are 
positive,  constant,  diagonal  matrices  which  characterize  the  actuator  dynamics.  Alterna¬ 
tively,  if  9 ,  9,  and  I  are  measurable,  then  the  following  equivalent  representation  of  the 
system  dynamics  is  typically  more  suitable  for  controller  design  [5]: 

KtI  =  He(9)9  +  V$(9, 9)9  +  Gfl(0)  +  Jj(9) P  (4a) 

u  =  MK'j'i  -f-  RK'j'I.  -\-  K 9  (4&) 

where  Kx  G  9?nXn  is  the  positive,  constant,  diagonal  matrix  which  relates  current  and 
torque  via  T  =  A7T.  Note  that,  for  conver&eifSe  of  development,  we  shall  assume  in  what 
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follows  that  the  environment  is  linearly  elastic  (so  that  the  magnitude  of  P  is  proportional 
to  the  environmental  deformation),  although  our  results  remain  valid  in  more  general 
situations. 

It  is  useful  for  our  subsequent  analysis  to  represent  the  dynamics  (3)  in  terms  of  the 
task-space  coordinates  x.  Developing  such  a  representation  is  standard  [e.g,  20],  and  the 
resulting  task-space  model  can  be  written  as 

F  =  H±  +  Vcck  +  G  +  P,  T  =  jjF  (5a) 

u  =  MT  +  RT  +  K6  (56) 

where  F  €  is  the  generalized  force  associated  with  x  and  H,  Vcc ,  G  are  the  task- 
space  counterparts  to  He,  Vg,  Gg  in  (3a).  It  is  well  known  that  the  rigid-link  manipulator 
dynamics  (5a)  possesses  considerable  structure.  For  example,  for  any  set  of  generalized 
coordinates  x,  the  dynamic  model  terms  if,  G  are  bounded  functions  of  x  whose  time 
derivatives  H,  G  are  also  bounded  in  x  and  depend  linearly  on  x,  the  matrix  if  is  symmetric 
and  positive-definite,  the  matrix  Vcc  is  bounded  in  x  and  depends  linearly  on  x,  and  the 
matrices  H  and  Vcc  are  related  according  to  H  =  Vcc  +  V^.  Additionally,  Vcc(x,x)y  = 
Fcc(x,y)x  Vy,  and  if  y  and  y  are  bounded  then  Vcc(x,  y)  is  bounded  and  Vcc(x,y)  grows 
linearly  with  x  [e.g.,  20].  All  of  these  properties  will  prove  useful  for  controller  development. 

In  this  paper  we  shall  address  both  the  impedance  control  problem,  in  which  the  control 
objective  is  to  regulate  the  dynamic  character  of  the  end-effector/environment  interaction, 
and  the  position/force  control  problem,  in  which  explicit  and  independent  control  of  end- 
effector  position  and  contact  force  is  desired.  In  what  follows,  it  is  assumed  that  8,  8,  T, 
and  P  are  measurable,  so  that  the  control  strategies  will  be  developed  for  the  dynamical 
system  (5).  It  should  be  noted,  however,  that  the  design  procedure  developed  here  is 
directly  applicable  to  the  task-space  version  of  the  dynamical  system  (4)  for  the  case  in 
which  9,9, 1,  and  P  can  be  measured. 

Observe  that  the  dynamic  model  (5)  consists  of  two  cascaded  dynamical  systems.  One 
consequence  of  this  structure  is  that  the  joint  torque  T  cannot  be  commanded  directly,  as  is 
assumed  in  the  design  of  controllers  at  the  torque  input  level,  and  instead  must  be  realized 
as  the  output  of  the  actuator  dynamics  (5b)  through  proper  specification  of  the  control 
input  u.  The  structure  of  the  RLED  manipulator  dynamics  (5)  suggests  partitioning  the 
control  system  design  problem  into  two  subproblems: 

manipulator  control:  regard  T  as  the  control  input  for  the  subsystem  (5a)  and  specify 
the  desired  evolution  of  this  variable  T<f(f)  in  such  a  way  that  if  T  =  then  the  given 
control  objective  would  be  achieved 

actuator  control:  specify  the  actual  contJoffnput  u  so  that  T  closely  tracks 
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This  approach  to  controller  design  is  adopted  in  this  paper,  so  that  each  of  the  proposed 
control  systems  consists  of  two  subsystems:  an  adaptive  strategy  that  provides  the  (fic¬ 
titious)  control  input  required  to  ensure  that  the  system  (5a)  evolves  as  desired,  and 
an  adaptive  scheme  that  determines  the  (actual)  control  input  u  which  guarantees  that 
the  system  (5b)  evolves  with  T  closely  tracking  T<f.  It  is  stressed  that,  although  each 
of  the  subsystems  is  essentially  designed  separately,  it  will  be  explicitly  verified  that  the 
subsystems  can  be  combined  to  yield  a  complete  control  system  with  the  desired  stability 
and  convergence  properties. 

Our  approach  to  the  design  and  analysis  of  suitable  controllers  is  based  on  Lyapunov 
stability  theory.  The  following  ultimate  boundedness  lemma  summarizes  the  properties  of 
a  certain  class  of  Lyapunov  functions  and  will  be  of  direct  relevance  in  our  development. 

Lemma  UB:  Consider  the  coupled  dynamical  system  Xj  =  fi(xi ,  X2,  f),  X2  =  f2(xi ,  X2 ,  t). 
Let  V(x1,x2,t)  be  a  Lyapunov  function  for  the  system  with  the  properties 

Ai  ||  xi  ||2  +A2  ||  x2  ||2<  V  <  A3  ||  xi  ||2  +A4  ||  x2  ||2 
V  <  — A5  ||  xj  ||2  — A6  ||  x2  ||2  +e 

where  e  and  the  Aj  are  positive  scalar  constants.  Define  8  =  max^/As,  A4/A6)  and 
ri  =  (£e/Ai)1</2  for  i  =  1,2.  Then  for  any  initial  state  xi(0),x2(0)  the  system  will  evolve 
so  that  X}(f),  x2(f)  are  uniformly  bounded  and  converge  exponentially  to  the  closed  balls 
■®ri  ?  ,  respectively,  where  Br(  =  {x,-  :  ||  x*  ||<  rj}. 

Proof:  Straightforward  apphcation  of  the  global  exponential  convergence  theorem  of  Cor- 
less  [21]  gives  the  result  (see  also  [22]).  ■ 

3.  Adaptive  Impedance  Controller 

In  this  section  we  develop  an  adaptive  impedance  control  system  for  uncertain  RLED 
manipulators.  When  the  end-effector  of  the  manipulator  is  in  contact  with  the  environ¬ 
ment,  the  objective  of  impedance  control  is  to  cause  the  end-effector  to  respond  to  contact 
forces  according  to  some  user-defined  dynamics.  For  example,  the  desired  dynamic  rela¬ 
tionship  between  the  end-effector  position  x,  the  end-effector  reference  trajectory  xr  £  9£n, 
and  the  end-effector/environment  contact  force  P  is  often  specified  as  follows: 


-^imp(Xr  x)  ~b  (-'impip^-r  x)  ”t~  Kimp(xr  x)  —  P 


(6) 


where  xr  is  bounded  with  bounded  derivatives  and  the  matrices  M,-mp ,  Cimp ,  Kimp  €  3£nXn 
are  specified  by  the  user  so  that  the  dynamics  (6)  possesses  the  desired  characteristics.  For 
instance,  these  matrices  are  typically  chosen  ^  be  constant  and  diagonal,  in  which  case 
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the  dynamical  system  (6)  represents  n  uncoupled  second-order  systems  in  the  n  task-space 
coordinates.  It  can  be  seen  from  (6)  that,  in  the  absence  of  environmental  contact,  we  have 
P  =  0  and  x(t)  -  xr(<)  Vi  (assuming  x  =  xr  initially),  which  corresponds  to  free-space 
trajectory  tracking.  In  the  presence  of  contact,  the  desired  response  is  for  the  end-effector 
to  comply  with  the  contact  forces  in  the  manner  defined  by  the  impedance  model  (6). 

The  proposed  impedance  control  system  consists  of  two  subsystems:  a  simple  filter 
which  characterizes  the  desired  performance  (6),  and  an  adaptive  task-space  control  scheme 
which  generates  the  control  input  u  to  the  RLED  manipulator  system  (5)  that  ensures  this 
desired  performance  is  realized.  In  what  follows,  the  proposed  control  system  is  presented 
and  analyzed,  and  then  simulation  results  illustrating  its  performance  are  described. 

3.1  Control  Scheme 

Observe  that  the  impedance  control  objective  can  be  realized  if  the  end-effector  posi¬ 
tion  x  closely  tracks  the  desired  impedance  trajectory  xd  €  3£n,  defined  as  the  solution  to 
the  differential  equation: 

MimpX-d  4"  CimpXd  4"  KimpX-d  ~  P  4~  MimpXr  -f-  CimpXr  -f-  KimpXr 

xd(0)=xr(0)  ,  xd(0)  =  xr(0)  (7) 

obtained  from  the  impedance  model  (6).  Equation  (7)  may  be  interpreted  as  defining  a 
simple  filter  which,  given  definitions  for  xr(i),  Mtmp,  Cimp,  and  Kimp  and  on-line  measure¬ 
ments  P ,  characterizes  the  desired  dynamic  relationship  between  end-effector  position  and 
contact  force  through  the  specification  of  xd.  The  function  of  the  task-space  impedance 
controller  is  to  cause  the  RLED  manipulator  system  (5)  to  evolve  in  such  a  way  that  the 
end-effector  position  x  closely  tracks  xd  defined  in  (7). 

Let  e  =  xd-x  denote  the  trajectory  tracking  error  and  E  =  Td-T  represent  the  torque 
tracking  error.  Consider  the  following  adaptive  tracking  scheme  for  RLED  manipulators: 

Fd  =  A(t)xd  +  f(i)  +  P  +  ki  72w  +  k2  72e 

w  =  — 27W  +  72e 

Ti  —  Ja  Fd  (8) 

u  =  [a(t)Td]  +  [b(i)T]  +  [c(i)0]  +  ka  E  +  J-1  q 

where  the  notation  [gh]  =  [g1h1,g2h2,  ...,gnhn]T  e  (for  any  two  n-vectors  g,h)  is 
introduced,  q  =  e  +  k2e/kij  —  w/7  is  the  weighted  and  filtered  position- velocity  error, 
kj,  k2 ,  7,  ka  are  positive  scalar  constants,  and  f(t),  a(t),  b(t), c(t)  €  5ft”,  A(t )  G  3?nXn  are 
adaptive  gains  which  are  adjusted  according  to  the  following  simple  update  laws: 

f  =  -iWiq 
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(9) 


A  =  -a2A  +  /32qxJ 
a  =  —03a  +  /?3[ET\f] 
b  =  —(74b  +  /?4[ET] 
c  =  -cr5c  +  /95[E  9] 

where  the  Oi  and  /?;  are  positive  scalar  adaptation  gains.  Inspection  of  (7)-(9)  reveals 
that  both  Td  and  T<i  can  be  easily  and  efficiently  computed  using  only  measurements  of 
manipulator  joint  position  and  velocity  and  end-effector  contact  force.  As  a  consequence, 
the  complete  controller  is  computationally  efficient  and  readily  implementable  using  mea¬ 
surements  of  6 ,  6,  T,  and  P  (recall  that  the  same  design  procedure  can  be  used  for  the 
case  in  which  6,  6, 1,  and  P  are  measurable). 

The  stability  properties  of  the  proposed  adaptive  impedance  controller  (7)-(9)  are 
summarized  in  the  following  theorem. 

Theorem  1:  The  adaptive  scheme  (7)-(9)  ensures  that  (5)  evolves  with  all  signals 
(semiglobally)  uniformly  bounded  provided  7  is  chosen  sufficiently  large.  Moreover,  the 
impedance  control  error  e,  e  is  guaranteed  to  converge  exponentially  to  a  compact  set 
which  can  be  made  arbitrarily  small. 

Proof:  Applying  the  control  law  (8)  to  the  manipulator  dynamics  (5)  yields  the  closed-loop 
error  dynamics 

He  +  Vcce  +  fcj 72w  +  k2 72e  +  +  $A*d  +  Vc<ie  -  J~T E  =  0 

[mE]  +  koE>  +  Ja  1q  +  [<^aTd]  +  [<^T]  +  \(f>c6\  =  0  (10) 

where  m,  r,  k  €  are  formed  by  stacking  the  n  diagonal  elements  of  M,  R,  K  into  vectors, 
<f>a  =  a  —  m,  =  b  —  r,  <f>c  =  c  —  k,  =  f  —  Hxd  —  G,  $,4  —  A  —  Vcd,  and  the  notation 
Vcd  =  Vcc(x,  Xi)  is  introduced.  Note  that  in  obtaining  (10)  the  Coriolis/ centripetal  term 
was  expanded  as  follows:  Vcckd  =  KdX  =  Vcikd  —  Vcde. 

Consider  the  Lyapunov  function  candidate 

V\  =  \-eTHe  +  ^A:272eTe  +  ^kiwTw  +  ~-eTHe  -  -wTHe  +  ^ET[mE] 

+ + + + k** + wf^  (11) 

and  note  that  V\  is  a  positive-definite  and  radially-unbounded  function  of  the  closed-loop 
system  state  if  7  is  chosen  sufficiently  large.  Computing  the  derivative  of  (11)  along  (10), 
and  simplifying  using  (9)  and  the  definition  for  w,  yields 

v.  =  ~(l  -  ^  II  e  f  2 II  W  II2  -K  II  E  II2 
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+  2wtJ ?e  +  -eT[-^-Vcce  -  Vccw]  -  qTVcde 

1  h 

+  M  +  J-te[$A{$A  -  ^qxJ)T] 

+  -  /33 [ETd])  +  ^f(i»  -  &[ET])  +  ^(4  -  &N) 

<  -(( 5  -  -^)A-(B')  -  M  II  e  II2  II  e  ||2  -4,7  II  w  ||2  -k.  ||  E  ||2 

+  (2Aj \t{H)  H - 1 - )  ||  w  || ||  e  |j  +7 — ( kccVM  +  kcli)  ||  e  ||||  e  || 

7  7  fci7 

+  ^  II  e  nil  e  II2  II  w  nil  e  ||2  II  */  l|2  II  *a  l|2f 

-  ||  e  ||2  ||  */  II2  II  e  1111  */  II  +j|  II  */  II 

-  | A m(H)  ||  e  ||2  ||  *A  fF  +|  II  e  1111  *A  IIf  +j|  II  Hr 


*3 

Pz 


ft  "  *  "2  "I  " 


|2  ,  V5_  Jfc  V7_ 
1  +  /?3+/34  /?5 


(12) 


where  Am(-),  A^(-)  denote  the  minimum  and  maximum  eigenvalue  of  the  matrix  argument, 
respectively,  ||  •  \\F  is  the  Frobenius  matrix  norm,  kcc  satisfies  ||  Vcc  ||F  <  kcc  ||  x  ||  Vx,  kcd 
is  an  upper  bound  on  Vcd  (i.e.,  ||  Vcd  ||F  <  kcd  Vx),  vm  is  an  upper  bound  on  ||  x<j  ||,  and 
the  r)i  are  positive  scalar  constants.  Note  that  the  existence  of  vmax  can  be  concluded  by 
examining  (7)  and  noticing  that  xr,  xr,  xr  are  bounded  by  definition  and  P  is  bounded 
because  the  workspace  of  the  manipulator  is  assumed  to  be  finite  (so  that  the  environmental 
deformation  must  be  finite)  [22,23]. 

Observe  that  the  following  inequalities  can  be  obtained  through  routine  manipulation: 


-jA„(F)  ||  e  ||2 


<7\ 


-jA  m(H) 


2ft 

a2  „  ^  ,l2 


I2  +-1 


Pi 


*/ll  + 


r\2_ 

Pi 


*/  II  <  - 


Ul 

Pi 


2P2  "  **  »  +| 


*a  ||f  +f2  ||  $a  ||f  <  | 


where  r]$,  rj 9  are  positive  scalar  constants  which  do  not  increase  as  the  Pi  axe  increased. 
These  inequalities  permit  the  following  upper  bound  on  V\  in  (12)  to  be  established: 


Vi  <  -«!  -  -^)A,»(tf)  -  M  II  e  II2 

+  (2A„(F)  +  hi  +  h£«)||w||||e||. 
7  7 


-hi 


w 


—k„  II  E 


k2 


hi 


(kCcvM  "b  ^cd)  ||  ®  ® 


+ 


k2kcc 

hi 

Hi 
Pz 


k 

|2  |  ^cc 

7 


w 


<?i 


-^H.f  —  Wfor—MeU2  +?■  +  ^^  +  y  +  lL  +  ’', 


<75 

>5 


2  Pi 

Hi 

'Pi 


<72 


P2 


2  P2 

5-  + 

Pz  Pi 


II  || 


/?5 


(13) 
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Let  Zl  =  [||  e  ||  ||  e  ||  ||  w  ||]T,  *  =  [||  ||  ||  $j4  ||F  ||  ^  ||  ||  ^  y  y  ^  y]T 

0min  mm(/^i)5  0max  ~~  max(^j),  3.11(1 


Q*  = 


■2k^(kccVM  +  hd) 
0 


2k^y(^ccvM  4"  &cd) 

(2  _  ±L 
V  2  k-t 


0 


pci  7 


)A m(H)  -  kcd 


-a  M(H)~ 


feed 

27 


~VM 


27 


-A m(H)  -  27 

hi 


hx,cvM 

27 


and  note  that  Q*  is  positive-definite  if  7  is  chosen  large  enough.  With  these  definitions 
the  following  bound  on  V\  in  (13)  can  be  derived: 


Vi  <  -  MQ*)  II  Zl  II2  -K 

h  kcc  11  „H  . 


E 


mo 


+ 


hi 


^cc 

7 


w 


0min 
II  A  1 1 2 


^ll 


(14) 


where  T]10,  rjn  are  scalar  constants.  Finally,  let  z2  =  [||  Zi  ||  ||  E  ||]T  and  Q  = 

diag[Am(<3  )?^aj-  Then  if  0  m  ax  /  ft  min  is  fixed  there  exists  a  positive  scalar  constant  t)  12 
that  does  not  increase  as  7  and  0min  increase,  and  positive  scalar  constants  A;  independent 
of  7  and  0mini  such  that  Vi  and  Vi  in  (11)  and  (14)  can  be  bounded  as 


A,  II  z2  ||2  +A-  ||  «r  |p <  Vi  <  A3  II  z2  IP  +-0-  ||  *  IP 

Pmin  Pmin 

Vx  <  -(A „(Q)  -  ^U1/2)  ||  z2  |p  -A-  ||  ¥  |p  +^12- 

7  Pmin  Pmin 


Now  choose  two  scalar  constants  VM,Vm  so  that  VM  >  Vm  >  Vi(0),  and  define  cM  = 
Am(Q)  -  m2  VMl,2h]  then  choose  7  large  enough  so  that  cM  >  0  (this  is  always  possible). 
Let  6  =  max(A3 /cm,  A4/A5)  and  choose  0o  so  that  rjuS/fio  <  Vm  (this  is  always  possible). 
Then  selecting  0min  >  0o  ensures  that  if  Vrn  <  Vi  <  Vm  then  V]  <  0.  This  condition 
together  with  VM  >  Vm  >  VrfO)  imphes  that  Vj(f)  <  VM  Vt,  so  that  c(t)  =  Am(Q)  - 
P)i2V/2W/7  >  cm  >  0  Vt  and 


Vi  <  —CM 


Z2 


(00  +  A0) 


*711 


(0o  +  A0) 


where  A0  =  0min  -  0O  and  it  is  assumed  that  0min  is  chosen  so  that  A/3  >  0.  Lemma  UB 
now  applies  and  permits  the  conclusion  that  ||  z2  ||,  ||  ^  ||  are  uniformly  bounded,  which 
imphes  that  e,  e,  w,  E,  f,  A,  a,  b,  and  c  are  uniformly  bounded.  Moreover,  ||  z2  ||,  ||  || 

converge  exponentially  to  the  closed  balls  Bn,  Br2,  respectively,  where 
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Observe  that  the  radius  of  the  ball  to  which  ||  Z2  ||2  is  guaranteed  to  converge  can  be 
decreased  as  desired  simply  by  increasing  A/3.  ■ 

Several  observations  can  be  made  concerning  the  adaptive  impedance  controller  (7)- 
(9).  First  note  that  the  control  strategy  is  simple,  utilizes  only  measurements  of  the 
system  state  and  contact  force,  and  requires  virtually  no  a  priori  information  concerning 
the  manipulator,  the  actuators,  or  the  environment.  Thus  the  proposed  scheme  provides 
a  computationally  efficient,  modular,  and  readily  implementable  approach  to  RLED  ma¬ 
nipulator  compliant  motion  control.  Theorem  1  shows  that  the  controller  ensures  uniform 
boundedness  of  all  signals,  and  that  the  ultimate  bounds  on  the  discrepancy  between  the 
desired  and  actual  performance  can  be  reduced  arbitrarily  by  increasing  the  adaptation 
gains  fii.  Note  that  increasing  these  adaptation  gains  does  not  ordinarily  lead  to  large 
control  action.  To  see  this,  observe  from  the  ultimate  bound  on  ||  ||  that  increasing  the 

does  not  increase  the  size  of  the  difference  between  adaptive  terms  and  model  terms, 
and  therefore  does  not  cause  unwarranted  growth  in  the  control  law  gains.  It  is  mentioned 
that  the  convergence  of  the  system  errors  is  exponential,  which  ensures  that  the  transient 
behavior  of  the  system  will  be  well-behaved.  Finally,  note  that  the  stability  properties 
established  in  Theorem  1  are  semiglobal  in  the  sense  that  the  region  of  attraction  can  be 
increased  arbitrarily  by  increasing  the  controller  gain  7.  Again  it  is  stressed  that  this  does 
not  imply  that  7  must  be  chosen  to  be  overly  large;  indeed,  we  have  found  that  excellent 
performance  is  obtainable  with  this  gain  set  to  quite  modest  values. 

3.2  Simulation  Results 

The  adaptive  impedance  control  scheme  described  above  is  now  applied  to  a  six  DOF 
robot  manipulator  through  computer  simulation.  The  robot  chosen  for  this  simulation 
study  is  the  IMI  Zebra  Zero  robot;  this  manipulator  possesses  a  conventional  design  with 
six  revolute  joints  configured  in  a  “waist-shoulder-elbow-wrist”  arrangement  (see  Figure 
1).  The  simulation  environment  incorporates  models  of  all  important  dynamic  subsys¬ 
tems  and  phenomena,  such  as  the  full  nonlinear  arm  dynamics,  actuator  dynamics,  joint 
stiction,  sensor  noise,  and  transmission  effects,  and  therefore  provides  the  basis  for  a  real¬ 
istic  evaluation  of  controller  performance.  The  control  law  is  applied  to  the  manipulator 
model  with  a  sampling  period  of  two  milliseconds,  and  all  integrations  required  by  the 
controller  are  implemented  using  a  simple  trapezoidal  integration  rule  with  a  time-step  of 
two  milliseconds. 

This  simulation  study  demonstrates  that  the  proposed  impedance  control  scheme  pro¬ 
vides  a  unified  framework  for  unconstrained  and  constrained  motion  control,  and  illustrates 
how  the  scheme  can  accommodate  the  trati!sitd2>n  between  unconstrained  and  constrained 
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end-effector  motion.  In  this  study,  a  frictionless  reaction  surface  with  a  stiffness  of  105  N/m 
is  placed  in  the  robot  workspace.  This  reaction  surface  is  oriented  normal  to  the  x  axis  of 
the  robot  base  frame  and  is  located  at  x  =  1.25m.  The  reference  trajectory  for  end-effector 
translation  is  defined  as  xr(t )  =  1.0-t-0.5sin7rt/4,  yr(i)  =  — 1.0+0. 5(1— cosirt  /  A),  zr(t)  =  0.0 
for  t  €  [0,4].  Additionally,  the  manipulator  is  required  to  maintain  its  initial  end-effector 
orientation.  The  end-effector  reference  trajectory  therefore  consists  of  a  rapid  semicircular 
motion  in  the  x,y  plane  with  a  radius  of  0.5m,  as  shown  in  Figure  2.  Note  that  the  ref¬ 
erence  trajectory  intersects  the  reaction  surface  at  approximately  t  =  0.7s,  so  that  at  this 
point  the  robot  undergoes  an  abrupt  transition  from  unconstrained  to  constrained  motion. 

The  adaptive  impedance  controller  (7)-(9)  is  used  in  this  study.  The  impedance  param¬ 
eters  in  (7)  are  defined  as  Mimp  =  diag[0.5],  Kxrnp  —  diag[100],  and  Cimp  =  diag[2 
observe  that  the  elements  of  Cimp  are  set  to  provide  critical  damping  in  the  impedance 
model.  The  adaptive  gains  f,  A,  a,  b,  c  are  set  to  zero  initially,  while  the  remaining 
controller  parameters  are  set  as  follows:  k\  =  10,  &2  =  20,  7  =  5,  ka  =  100  and  <Ji  =  0.01, 
j3i  =  100  for  i  =  1,2,...,  5.  The  results  of  this  simulation  axe  given  in  Figures  2a  and 
2b.  Figure  2a  indicates  that  the  end-effector .  closely  tracks  the  reference  trajectory  until 
contact  with  the  reaction  surface  is  made.  At  contact,  the  robot  smoothly  ceases  to  track 
the  x  component  of  the  reference  trajectory  in  favor  of  complying  with  the  reaction  sur¬ 
face;  note  that  this  behavior  has  very  little  effect  on  the  robot  motion  in  the  y  direction. 
The  end-effect  or/surface  contact  force  P  is  shown  in  Figure  2b.  It  can  be  seen  that  the 
transient  forces  incurred  during  the  transition  from  unconstrained  to  constrained  motion 
are  well  behaved,  and  that  the  contact  force  follows  a  semicircular  profile,  reflecting  the 
influence  of  the  user-specified  reference  trajectory  on  the  contact  force. 

4.  Adaptive  Position/Force  Controller 

We  now  turn  to  the  development  of  an  adaptive  position/force  controller  for  uncertain 
RLED  manipulators.  Consider  the  manipulator/actuator  system  (5)  interacting  with  a 
linearly  elastic  environment.  In  what  follows  we  shall  model  the  rigid  manipulator  por¬ 
tion  of  this  system  using  the  approach  proposed  in  [23].  Thus  we  suppose  the  existence 
of  a  task  frame  T,  and  let  x  denote  the  (possibly  augmented)  end-effector  coordinates 
relative  to  this  frame.  It  is  easy  to  show  that  all  of  the  “nice”  properties  of  the  rigid- 
link  manipulator  dynamics  (5a)  which  are  summarized  in  Section  2  are  retained  with  this 
definition  for  the  system  task-space  coordinate  vector  x  [24].  Moreover,  in  this  case  the 
end-effector/environment  interaction  can  be  modeled  as 

(x-xe)  '  ‘  (15) 


P  = 


p: 


=  £e(x  2-JW)  = 


Ke 

KL 
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In  (15),  JCe  €  3?nxn  and  xe  €  are  the  environmental  stiffness  and  location,  respectively, 
Pf  €  9£r,  Ke  G  3?rXn  has  independent  rows,  and  the  elements  of  P't  and  rows  of  K'e 
are  linear  combinations  of  the  elements  of  Pt  and  rows  of  Ke ,  respectively;  therefore 
only  the  Pt  component  of  P  can  be  independently  controlled.  Let  K+  =  Kj (KeKj)~' , 
^ e  ~  I  ~  K+Ke,  and  define  K*  G  3?nX^ra  r)  and  x<  G  9£n-r  so  that  the  range  spaces  of 
Ke  and  Ke  are  identical  and  so  that  K* x i  =  Ke  x  Vx.  With  these  definitions  it  is  easily 
verified  that  x  =  K+ Pt  +  K*xt,  so  that  Pt,  xt  completely  define  the  configuration  of  the 
rigid-link  manipulator  (5a)  and  can  be  independently  controlled  [23].  The  complete  RLED 
manipulator  model  (5)  can  be  written  in  terms  of  the  pair  Pt,  x<  as  follows: 

F  =  H(K+ Pt  +  K*xt)  +  VCC(K+Pt  +  K*ext)  +  G  +  P,  T  =  jJ  F  (16a) 

u  =  MT  +  RT  +  K8  (166) 

In  typical  applications  the  environmental  stiffness  matrix  K.e  is  not  accurately  known. 
However,  if  the  environment  is  isotropic  in  addition  to  elastic,  Kt  can  be  parameterized  as 
)Ce  =  keE,  where  ke  is  an  unknown  positive  constant  and  E  G  &nXn  is  easily  determined 
from  the  environmental  geometry.  In  this  case,  direct  calculation  reveals  that  K+  = 
(1  /ke)E+  and  K*  =  E*,  so  that  K*  is  known  and  the  only  uncertainty  associated  with 
Ke  is  an  unknown  constant  coefficient;  it  will  be  shown  below  that  this  latter  uncertainty 
can  be  compensated  with  the  proposed  control  scheme. 

4.1  Control  Scheme 

The  objective  of  the  proposed  position/force  controller  is  to  ensure  that  the  RLED 
manipulator  dynamics  (16)  evolves  so  that  xt  tracks  the  desired  position  trajectory  xtd(t) 
(which  is  bounded  with  bounded  derivatives)  and  Pt  approaches  the  desired  constant  force 
setpoint  Ptd.  Consider  the  following  adaptive  solution  to  this  problem: 

Ed  =  A(t)xtd  +  B(t)xtd  +  f(t)  +  P  +  ki^2w  +  &272e 
w  =  —2jw  +  j2e 
f  =  -o-if  +  Aq 
A  =  — ct2^4  + 

B  =  -a3B  +  03qx[d 

T<*  =  JIFd  (17) 

u  =  [a(f)Td]  +  [b(f)T]  +  [c(i)0]  +  &aE  +  J~l  q 
a=  -^a  +  ^fETd] 
b  =  -a5b  +  yd5[ET] 
c  = -cr6c  +  06[E9]  2-54 
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where  e  =  (l/ke)E+(Ptd  —  P<)  +  E*(xtd  —  xt)  is  the  position/force  error.  Inspection 
of  (17)  reveals  that  both  T<£  and  can  be  easily  and  efficiently  computed  using  only 
measurements  of  manipulator  joint  position  and  velocity  and  end-effector  contact  force. 
As  a  consequence,  the  complete  controller  is  computationally  efficient  and  readily  imple- 
mentable  using  measurements  of  9,  6,  T,  and  P  (recall  that  the  same  design  procedure 
can  be  used  for  the  case  in  which  9 ,  0,  I,  and  P  are  measurable). 

The  stability  properties  of  the  proposed  adaptive  position/force  controller  (17)  are 
summarized  in  the  following  theorem. 

Theorem  2:  The  adaptive  controller  (17)  ensures  that  (16)  evolves  with  all  signals 
(semiglobally)  uniformly  bounded  provided  7  is  chosen  sufficiently  large.  Moreover,  the 
position  error  xtd  —  x*  and  force  error  P td  —  P*  are  guaranteed  to  converge  exponentially 
to  compact  sets  which  can  be  made  arbitrarily  small. 

Proof:  Applying  the  control  law  (17)  to  the  RLED  manipulator  dynamics  (16)  yields  the 
closed-loop  system  dynamics 

He  +  Vcce  +  h 72w  +  k2 72e  +  $*f  +  +  $B*td  +  Vcde  -  J~T E  =  0 

[mE]  +  ka  E  +  Ja  1  q  +  [<^aT<i]  +  [<^T]  +  \<j>c9 ]  =  0  (18) 


where  =  f  —  G,  —  A  —  HE *,  =  B  —  VcdE*,  and  all  other  terms  are  defined  as 

before.  Consider  the  Lyapunov  function  candidate 


1-Ttt  a,  1U  At  ,  h  JT--  1  -r  — •  1 


V2  =  --e1  He  +  ^krfe^e  +  ^jWTw  + 

11  2  A77 


e1  He  —  —  w 1  He  +  ^ET[mE]  (19) 

7  2 


+  + \ tllTA'A<t*  +  s4®*®1  +  ~aTa  "  ~aTj" 


and  note  that  V2  is  a  positive-definite  and  radially-unbounded  function  of  the  closed-loop 
system  state  if  7  is  chosen  sufficiently  large.  Computing  the  derivative  of  (19)  along  (18) 
and  simplifying  as  in  the  proof  of  Theorem  1  yields  the  following  upper  bound  on  V2 : 


V2<-\  m(Q*) 
k2kcc 


+ 


hi 


e  II2 


h  II  E  II2 

k 

1 


Viz 


0 


w 


mm 

2 


\J>*  ||2 

Vl4 

ftmin 


(20) 


where  —  [||  <&*f  ||  ||  $*A  ||F  ||  ||F  ||  <f>a  ||  ||  4>b  ||  ||  <f>c  ||]T  and  tj13,  r/14  are  scalar 

constants. 

Now  arguments  identical  to  the  ones  used  in  the  proof  of  Theorem  1  can  be  used  to 
show  that  e,  e,  w,  E,  f,  A ,  B,  a,  b,  and  c  are  uniformly  bounded,  and  that  the  ultimate 
size  of  the  ball  to  which  e  and  e  are  guarantied  to  converge  can  be  decreased  as  desired 
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simply  by  increasing  the  /?,.  Observe  that  this  latter  fact  implies  that  the  ultimate  size  of 
the  position  error  xtd  -  xt  and  force  error  -  Pt  can  be  reduced  arbitrarily  as  well.  To 
see  this,  note  that  small  e  implies  that  Kee  =  Ptd  -  Pt  is  small,  and  this  in  turn  implies 
that  xtd  —  xt  is  small.  ■ 

Several  observations  can  be  made  concerning  the  proposed  adaptive  position/force 
controller  (17).  First  note  that  the  control  strategy  is  simple,  utilizes  only  system  state 
and  force  measurements,  and  requires  virtually  no  information  concerning  either  the  ma¬ 
nipulator/actuator  system  dynamics  or  the  environment.  In  particular,  it  is  noted  that 
knowledge  of  the  environmental  stiffness  is  not  required  for  implementing  the  scheme,  de¬ 
spite  the  appearance  of  ke  in  the  definition  of  position/force  error  e.  To  see  this,  notice 
that  e  always  appears  with  constant  coefficients  in  the  control  strategy,  so  that  ke  can 
always  be  “absorbed”  into  these  constants  for  implementation.  Thus  the  scheme  provides 
a  computationally  efficient,  modular,  and  readily  implementable  approach  to  RLED  ma¬ 
nipulator  compliant  motion  control.  Theorem  2  shows  that  the  controller  ensures  uniform 
boundedness  of  all  signals,  and  that  the  ultimate  bounds  on  the  discrepancy  between  the 
desired  and  actual  performance  can  be  reduced  arbitrarily  by  increasing  the  adaptation 
gains  fa.  Thus  all  of  the  observations  made  regarding  the  impedance  controller  (7)-(9) 
following  the  proof  of  Theorem  1  apply  to  the  position/force  control  scheme  (17)  as  well. 

Finally,  it  is  interesting  to  observe  that,  if  xtd  is  a  constant  position  setpoint,  then  the 
proposed  adaptive  controller  (17)  yields  asymptotic  position/force  regulation.  This  result 
is  established  in  the  following  corollary  to  Theorem  2. 

Corollary:  Suppose  that  the  desired  position  trajectory  Xtd  is  a  constant  setpoint.  Then, 
if  the  (Ji  are  set  to  zero,  the  adaptive  controller  (17)  ensures  that  x%  — >  x%d  and  Pt  — *  P td 
as  t  —*■  oo. 

Proof:  Applying  the  control  law  (17)  (with  xtd  =  xtd  =  0)  to  the  manipulator  dynamics 
(16)  yields  the  closed-loop  system  dynamics 

He  +  Fcce  +  k^w  +  k2 j2e  +  $d  +  G(xtd )  -  G(xt)  -  J“rE  =  0 
[mE]  +  ka E  +  Ja  *q  +  [<^aTd]  +  [<^T]  +  [<t>c6\  =  0  (21) 

where  $d  =  f  —  G(x<</).  Let  U (x<)  denote  the  manipulator  gravitational  potential  energy. 
Consider  the  Lyapunov  function  candidate 

Vz  =  \*THe  +  ^k2'j2eTe  +  |fciWTw  +  -j^-eTHe  -  ^wTIfe  +  |ET[mE]  (22) 

+  2 +  2jh^(t>a  +  ~  U(xm)  +  GT(xtd)e 


15 


and  note  that  V3  is  a  positive-definite  and  radially-unbounded  function  of  the  closed-loop 
system  state  if  7  is  chosen  sufficiently  large  (for  example,  if  7  is  chosen  large  enough  then 

the  term  t272ere/ 2  +  V(xt)  -  U(xtd)  +  GI'(xtJ)e  can  be  shown  to  be  positive-definite  in 
e). 

Differentiating  (22)  along  (21)  and  simplifying  as  in  the  proof  of  Theorem  2  yields 
Vs  =  -(7  ~  ^  ||  e  ||2  -^7  ||  w  ||2  -ka  ||  E  ||2  +2wTtfe 

+  7T|7F“e  -  v“w]  +  -  Aq)  +  -  A[etj]) 

+  -  &[ETj)  +  -  /36[E0])  +  (G(x,)  -  G(xtd))(-~e  -  )w) 

£  11  "  l|2  ^  -  I7)  II  e  I'2  ^  II  w  II2  "*•  II  E  II2  (23) 

+  2A „(H)  ||  w  I  e  ||  ||  w  HU  e  ||  + ^  ||  e  ||||  e  f  ||  w  ||||  e  ||2 

where  M  is  a  positive  scalar  constant  satisfying  M  ||  xtd-xt  ||>||  G(x«)-G(xt)  ||  Vxti,xt 
(the  boundedness  of  the  partial  derivatives  of  G  ensures  that  such  an  M  exists).  Define 
Q*c  as 

"Ml  _  bM  n  _  M  ‘ 

fci  *17  u  27 

3*  =  0  (7_^_)Am(F) 

.  ki-y 

(with  Q*  positive-definite  if  7  is  chosen  large  enough)  and  let  Qc  =  diag[Am(Q*),  ka ].  Then 
the  following  bound  on  V3  in  (23)  can  be  obtained  through  routine  manipulation: 

1>3  <  -(A .„((?«)  -  ||  z2  ||2  (24) 

where  t]15  is  a  scalar  constant  which  does  not  increase  as  7  is  increased.  Let  cc(t)  — 
^m(Qc)  ~  fhhVz1^{t)h  and  choose  7  large  enough  so  that  cc(0)  >  0  (this  is  always  pos¬ 
sible).  Then  (24)  implies  that  cc(0)  <  cc(t)  Vt,  so  that  V3  <  -cc(0)  ||  z2  ||2.  Standard 
arguments  can  then  be  used  to  show  that  all  signals  are  bounded  and  that  z2  converges  to 
zero  (and  therefore  that  e  converges  to  zero)  [25].  Finally,  note  that  the  convergence  of  e 
to  zero  implies  that  P<  — »  P t<i-  which  in  turn  implies  that  xt  —*  xt(j.  ■ 

4.2  Experimental  Results 

The  performance  of  the  proposed  adaptive  position/force  control  strategy  is  now  stud¬ 
ied  in  an  experimental  investigation.  The  experimental  facility  utilized  for  this  study  is  the 
New  Mexico  State  University  Robotics  LabSreSbry  and  consists  of  an  IMI  Zebra  Zero  robot 
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arm  and  the  associated  control  computer  (see  Figure  1).  All  control  software  is  written 
in  ‘C’  and  is  hosted  on  an  IBM-compatible  486  personal  computer.  Note  that  the  Zebra 
robot  is  one  of  the  few  commercially  available  manipulators  with  a  force/torque  sensor 
integrated  directly  into  their  design;  this  fact  motivated  the  selection  of  this  robot  for  both 
the  simulation  and  experimental  investigations  presented  in  this  paper. 

The  proposed  adaptive  position/force  control  scheme  (17)  is  implemented  in  the 
present  experimental  study.  The  task  specified  in  the  experiment  requires  the  manipu¬ 
lator  to  exert  a  user-defined  contact  force  on  the  environment  while  tracking  a  position 
trajectory  on  the  constraint  surface.  Two  different  environments  are  utilized  in  the  exper¬ 
iments.  a  soft  environment  consisting  of  a  large  desk  blotter  placed  on  the  laboratory 
table,  and  a  “stiff” environment  obtained  by  mounting  a  metal  plate  on  the  laboratory 
table.  In  each  experiment,  the  reaction  surface  is  placed  in  the  robot  workspace  so  that 
it  is  oriented  normal  to  the  z  axis.  The  manipulator  is  commanded  to  exert  a  contact 
force  of  10. ON  on  the  reaction  surface,  maintain  its  initial  end-effector  orientation,  and 
track  the  following  desired  position  trajectory  in  the  x,y  plane:  xd(t)  =  1.0  +  0.5sin7rt/4, 
Vd(t)  =  —1.0  +  0.5(1  —  cos7rt/4)  for  t  £  [0,4].  The  desired  position/force  control  task  is 
therefore  to  exert  a  constant  force  normal  to  the  reaction  surface  while  tracking  a  semicir¬ 
cular  trajectory  on  this  surface. 

The  experiments  are  designed  to  evaluate  the  effectiveness  of  the  proposed  controller 
(17)  for  simultaneously  controlling  end-effector  position  and  contact  force,  and  to  examine 
the  sensitivity  of  the  controller  to  a  large  variation  in  environmental  stiffness.  For  all 
experiments,  the  controller  gains  are  set  to  ki  =  10,  k2  =  20,  7  =  5,  and  ka  =  100.  All  of 
the  adaptive  elements  are  initialized  to  zero,  and  the  adaptation  gains  are  set  as  follows: 

=  10,  fa  —  03  —  04  =  /?5  =  Pe  =  15  and  (Ti  =  0.1  for  i  =  1,2, ...,  6  (for  all  gains,  the 
units  for  force  and  length  are  Newton  and  centimeter,  and  e  is  computed  by  “normalizing” 
the  force  error  component  through  division  by  1000).  The  adaptive  controller  is  applied  to 
the  robot  with  a  sampling  period  of  seven  milliseconds,  and  all  integrations  required  by  the 
controller  are  implemented  using  a  simple  trapezoidal  integration  rule  with  a  time-step  of 
seven  milliseconds.  The  results  of  these  experiments  are  given  in  Figures  3a-3c,  with  Figure 
3a  depicting  the  position  trajectory  tracking  performance  for  the  stiff  environment  (the 
soft  environment  performance  was  virtually  identical),  Figure  3b  showing  the  force  control 
response  for  the  soft  environment,  and  Figure  3c  illustrating  the  force  control  response  for 
the  stiff  environment.  It  can  be  seen  from  the  plots  that  the  desired  position  trajectory  is 
closely  tracked  and  the  desired  force  setpoint  is  accurately  maintained,  despite  the  lack  of 
a  priori  information  concerning  the  manipulator  and  the  environment.  Additionally,  it  is 
observed  that  the  performance  is  robust  to  l^rg<|  variations  in  environmental  characteristics 
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(desk  blotter  and  steel  plate). 


5.  Conclusions 

This  paper  presents  two  new  adaptive  compliant  motion  control  schemes  for  uncertain 
RLED  manipulators.  It  is  shown  that  the  schemes  ensure  semiglobal  uniform  boundedness 
of  all  signals,  and  that  the  ultimate  size  of  the  system  errors  can  be  made  arbitrarily  small. 
The  capabilities  of  the  proposed  schemes  are  illustrated  through  both  computer  simula¬ 
tions  and  laboratory  experiments  with  an  IMI  Zebra  Zero  manipulator.  These  studies 
indicate  that  the  adaptive  impedance  controller  provides  an  accurate  and  robust  means 
of  realizing  the  desired  end-effector  impedance  characteristics,  and  that  the  adaptive  posi¬ 
tion/force  control  scheme  represents  an  effective  method  of  controlling  end-effector  position 
and  force  independently.  Future  work  will  address  the  problem  of  controlling  uncertain 
RLED  robotic  systems  when  they  are  subjected  to  more  general  classes  of  contraints. 
Progress  in  this  direction  for  the  special  case  of  rigid  manipulators  performing  waste  man¬ 
agement  operations  are  reported  in  [26]. 
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Figure  2b:  End-Effector/environment  contact  force  in  impedance  control  simulation  ex¬ 
ample. 


position/force  control  experim^n#3 


Figure  3b: 


End- Effector /environment  contact  force  in  position/force  control  experiment 
(soft  environment). 
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Figure  3c:  End-Effector/environment  contact  force  in  position/force  control  experiment 


(stiff  environment). 
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Abstract 

This  paper  considers  the  motion  control  problem  for  uncertain  robot  manipulators  for 
the  case  in  which  only  joint  position  measurements  axe  available,  and  proposes  an  adaptive 
controller  as  a  solution  to  this  problem.  The  proposed  control  strategy  is  very  general 
and  computationally  efficient,  requires  very  little  information  regarding  the  manipulator 
model  or  the  payload,  and  ensures  that  the  position  error  possesses  desirable  convergence 
properties:  semiglobal  asymptotic  convergence  if  no  external  disturbances  are  present  and 
semiglobal  convergence  to  an  arbitrarily  small  neighborhood  of  zero  in  the  presence  of 
bounded  disturbances.  It  is  shown  that  the  adaptive  controller  can  be  modified  to  provide 
accurate  tracking  control  through  the  introduction  of  additional  feedforward  elements  in 
the  control  law.  Computer  simulation  results  are  given  for  a  PUMA  560  robot  and  a  Zebra 
Zero  manipulator;  these  results  demonstrate  that  accurate  and  robust  position  regulation 
and  trajectory  tracking  can  be  achieved  by  using  the  proposed  controller.  Experimental 
results  axe  presented  for  a  Zebra  Zero  manipulator  and  confirm  that  the  proposed  approach 
provides  a  simple  and  effective  means  of  obtaining  high  performance  motion  control. 

1.  Introduction 

The  position  regulation  problem  for  robot  manipulators  involves  designing  a  control 
strategy  to  cause  the  manipulator  to  evolve  from  its  initial  state  to  some  desired  final  state, 
where  typically  the  final  velocity  is  to  be  zero.  This  is  also  referred  to  as  the  position  control 
or  point-to-point  motion  control  problem,  emphasizing  the  fact  that  only  the  final  position 
is  specified  and  the  intermediate  trajectory  is  not  stipulated.  It  is  well-known  that  simple 
proportional-derivative  (PD)  feedback  controllers  axe  capable  of  globally  asymptotically 
stable  regulation  of  rigid  robots,  provided  that  the  effects  of  gravity  on  the  manipulator 
are  compensated.  Ordinarily  the  requisite  gravity  compensation  is  achieved  by  including  a 
gravity  model  in  the  control  scheme,  so  that  the  resulting  controller  possesses  a  PD-plus- 
gravity  structure  [1]. 

While  the  PD-plus-gravity  position  cdntj?3l  law  is  simple,  elegant,  and  intuitively  ap- 


pealing,  there  are  two  potential  difficulties  associated  with  this  approach.  First,  providing 
gravity  compensation  by  including  a  model  of  the  manipulator  gravity  torques  in  the  con¬ 
trol  law  can  be  undesirable  because  this  requires  precise  a  priori  knowledge  of  both  the 
structure  and  the  parameter  values  for  this  model,  including  the  effects  of  any  payload. 
Particularly  restrictive  is  the  need  for  information  concerning  the  payload,  because  in  typ¬ 
ical  tasks  many  different  payloads  are  encountered  and  it  is  unrealistic  to  assume  that 
the  properties  of  all  payloads  are  accurately  known.  Additionally,  this  requirement  limits 
the  modularity  and  portability  of  the  controllers,  so  that  implementation  of  these  schemes 
with  a  new  manipulator  can  involve  considerable  effort.  The  second  drawback  with  PD- 
plus-gravity  regulation  schemes  is  that  these  controllers  include  a  linear  state  feedback 
component  and  therefore  require  full  state  measurement,  which  can  be  problematic  in 
practice.  For  example,  although  manipulators  are  generally  equipped  with  high-precision 
sensors  capable  of  accurately  measuring  joint  position,  determining  the  joint  velocities  is 
more  challenging  and  typically  the  velocity  measurements  are  contaminated  with  noise  [2]. 
In  fact,  velocity  sensors  are  frequently  omitted  altogether  in  the  interest  of  saving  money, 
volume,  and  weight  in  manipulator  construction  [2],  so  that  in  many  applications  only 
position  information  is  available  for  feedback. 

Recognizing  these  difficulties,  several  researchers  have  studied  methods  of  addressing 
one  or  the  other  problem.  Studies  focusing  on  reducing  the  information  required  concerning 
the  manipulator  gravity  model  have  tended  to  concentrate  on  adaptive  control  approaches. 
For  instance,  the  schemes  presented  in  [3-5]  possess  a  PD-plus-gravity  structure  in  which 
the  (constant)  inertial  parameters  for  the  manipulator  and  payload  are  assumed  to  be  un¬ 
known,  and  this  uncertainty  is  compensated  adaptively.  Implementation  of  these  adaptive 
controllers  still  requires  knowledge  of  the  structure  of  the  gravity  model  and  measurements 
of  joint  velocity,  however.  Techniques  for  eliminating  the  need  for  velocity  measurements  in 
manipulator  controllers  have  been  proposed  in  [e.g.,  6-10].  One  approach  to  this  problem 
is  to  design  an  observer  for  estimating  the  velocity  utilizing  the  available  (high-quality) 
position  information,  and  then  to  use  this  estimated  velocity  in  place  of  the  actual  velocity 
in  the  state  feedback  component  of  the  control  law;  this  strategy  is  used  in  [6-8]  to  solve 
the  manipulator  tracking  problem.  Alternatively,  the  simple  structure  of  the  PD-plus- 
gravity  solution  to  the  manipulator  regulation  problem  suggests  a  correspondingly  simple 
solution  to  the  regulation  problem  when  only  position  measurements  are  available.  Thus 
elegant  control  laws  composed  of  a  gravity  compensation  term  and  a  linear  dynamic  (first 
order)  compensator  have  been  proposed  very  recently  in  [9,10].  It  is  noted  that  while 
these  schemes  eliminate  the  need  for  velocity  measurements,  they  still  require  knowledge 
of  the  gravity  model  for  the  manipulator  and$p,yload  to  provide  accurate  position  control. 


2 


Research  in  which  the  above  objectives  are  combined,  so  that  accurate  position  regulation 
is  achieved  without  gravity  model  information  or  velocity  measurements,  appears  to  be 
lacking;  preliminary  results  along  these  lines  are  reported  by  the  authors  in  [11].  An  addi¬ 
tional  observation  concerning  this  work  is  that  none  of  the  regulation  schemes  presented 
in  [3-10]  have  been  verified  through  a  comprehensive  simulation  and  experimental  study 
involving  industrial  robots. 

This  paper  introduces  an  adaptive  position  regulation  scheme  that  does  not  require 
joint  velocity  measurements  or  knowledge  of  the  gravity  model  for  the  manipulator  or  pay- 
load.  The  proposed  controller  is  general,  computationally  simple,  and  easy  to  implement. 
It  is  shown  that  the  adaptive  strategy  ensures  semiglobal  uniform  boundedness  of  all  sig¬ 
nals  in  the  presence  of  bounded  external  disturbances  and  possesses  desirable  convergence 
properties:  asymptotic  convergence  of  the  position  error  if  no  disturbances  are  present  and 
convergence  of  the  error  to  an  arbitrarily  small  neighborhood  of  zero  in  the  presence  of 
disturbances.  Moreover,  it  is  demonstrated  that  the  regulation  scheme  can  be  modified 
to  provide  accurate  tracking  control  through  the  introduction  of  additional  feedforward 
elements  in  the  control  law.  The  results  of  extensive  computer  simulations  and  hardware 
experiments  are  presented  to  illustrate  the  efficacy  of  the  proposed  approach. 

The  paper  is  organized  as  follows.  In  Section  2  some  preliminary  facts  of  relevance  to 
the  manipulator  motion  control  problem  are  established.  The  adaptive  regulation  scheme 
is  presented  and  analyzed  in  Section  3.  The  performance  of  the  controller  is  illustrated  in 
Section  4  through  a  computer  simulation  study  and  in  Section  5  through  an  experimental 
investigation.  Finally,  Section  6  summarizes  the  paper  and  draws  some  conclusions. 

2.  Preliminaries 

This  paper  considers  the  adaptive  position  regulation  problem  for  uncertain  manipu¬ 
lators  using  only  position  measurements.  Let  y  define  the  position  and  orientation  of  the 
robot  end-effector  relative  to  a  fixed  user-defined  reference  frame  and  note  that,  in  the 
most  general  case,  the  elements  of  y  are  local  coordinates  for  some  smooth  manifold  AA  of 
dimension  m.  The  forward  kinematic  and  differential  kinematic  maps  between  the  robot 
joint  coordinates  6  €  Af  and  the  end-effector  coordinates  y  can  be  written  as 

y=h(0),  y  =  J(6)8  (1) 

where  Af  is  a  smooth  manifold  of  dimension  n,  h  :  Af  — »  M,  and  J  €  £mXn  is  the 
end-effector  Jacobian  matrix. 

It  is  often  desirable  to  formulate  the  manipulator  control  problem  in  terms  of  general¬ 
ized  coordinates  other  than  the  joint  coordinates  6.  If  the  manipulator  is  nonredundant  (so 
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that  m  =  n)  and  is  in  a  region  of  the  workspace  where  J  has  full  rank,  then  the  elements 
of  y  are  also  generalized  coordinates  for  the  manipulator.  Observe  that  the  end-effector 
coordinates  y  are  typically  more  task-relevant  than  the  joint  coordinates  8 ,  so  that  de¬ 
veloping  the  controller  in  terms  of  y  can  lead  to  improved  performance,  efficiency,  and 
implement  ability.  A  task-space  formulation  can  also  be  realized  if  the  manipulator  is  kine¬ 
matically  redundant  (so  that  m  <  n)  by  utilizing,  for  example,  the  configuration  control 
approach  [e.g.,  12,13].  In  what  follows,  we  shall  consider  nonredundant  and  redundant 
robots  together  and  formulate  the  manipulator  control  problem  in  terms  of  a  set  of  n  gen¬ 
eralized  coordinates  x.  To  retain  generality,  we  shall  require  only  that  the  elements  of  x 
are  local  coordinates  for  some  smooth  manifold  X  of  dimension  n,  and  that  the  kinematic 
relationship  between  8  and  x  is  known  and  can  be  written  in  a  form  analogous  to  (1): 

x  =  h  a(8),  x  =  Ja(8)8  (2) 

where  ha  :  J\f  — >  X  and  Ja  G  3f?nX”.  Observe  that  for  x  to  be  a  valid  generalized  coordinate 
vector  the  elements  of  x  must  be  independent  in  the  region  of  interest;  thus  it  will  be 
assumed  in  our  development  that  Ja  is  of  full  rank.  Note  also  that  joint-space  control  is 
trivially  recovered  by  defining  ha  to  be  the  identity  map. 

Consider  the  manipulator  dynamic  model  written  in  terms  of  the  generalized  coordi¬ 
nates  x: 

F  =  .ff(x)x  +  Vcc(x,x)x  +  G(x)  +  d(t)  (3) 

where  F  G  is  the  generalized  force  associated  with  x,  H  €  5ft" Xn  is  the  manipulator 
inertia  matrix,  Vcc  G  9£nXn  quantifies  Coriolis  and  centripetal  acceleration  effects,  and  G  G 
is  the  vector  of  gravity  forces.  The  term  d  G  3?n  is  an  unknown  vector,  assumed  to  be 
bounded  with  bounded  first  derivative  with  respect  to  time,  that  can  represent  unmodelled 
effects  or  external  disturbances;  this  term  is  included  in  the  system  dynamics  to  permit 
the  robustness  of  the  proposed  regulation  scheme  to  be  studied.  It  is  well-known  that 
the  dynamics  (3)  possesses  considerable  structure.  For  example,  for  any  set  of  generalized 
coordinates  x,  the  matrix  H  is  bounded,  symmetric  and  positive-definite,  the  matrices  H 
and  14c  are  bounded  in  x,  depend  linearly  on  x,  and  are  related  according  to  H  =  Vcc  +  Vjc. 
The  vector  of  gravity  forces  G  is  the  gradient  of  a  potential  energy  function  and  is  bounded 
with  bounded  first  partial  derivatives.  Additionally,  I4c(x,x)y  =  14c(x,y)x  Vy,  and  if 
y  and  y  are  bounded  then  V'cc(x,y)  is  bounded  and  V'cc(x,y)  grows  linearly  with  x.  All 
of  these  properties  except  the  last  one  are  well-known  [e.g.,  14];  the  last  property  of  the 
Coriolis/ centripetal  matrix  Vcc  is  essentially  given  in  [15]  and  is  easy  to  see  by  expanding 
Vcc  in  terms  of  Christoffel  symbols.  2-68 
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The  control  system  proposed  in  this  paper  consists  of  two  subsystems:  an  adaptive 
scheme  that  provides  the  control  input  F  required  to  ensure  that  the  system  (3)  evolves 
from  its  initial  state  to  the  desired  final  state  x  =  x<*,  x  =  0,  and  an  algorithm  for 
mapping  the  control  input  F  to  a  physically  realizable  joint-space  control  torque  T  € 

If  x  is  chosen  as  described  above  then  the  required  mapping  between  F  and  T  is  simply 
T  =  JaT F.  Observe  that  the  desired  task-space  control  input  F  can  always  be  achieved  in 
this  way  because  it  has  been  assumed  that  Ja  is  of  full  rank  (this  is  the  standard  assumption 
in  task-space  controller  development).  It  is  noted  that  this  rank  assumption  could  be 
violated  if,  during  the  robot’s  motion,  a  kinematic  singularity  is  encountered;  in  this  event 
Ja  loses  rank  and  the  desired  control  input  F  may  or  may  not  be  realizable.  However, 
extensive  computer  simulations  and  hardware  experiments  have  indicated  that  task-space 
implementations  of  position  regulation  schemes  axe  quite  robust  to  kinematic  singularities, 
and  indeed  this  robustness  is  to  be  expected.  For  example,  under  position  regulation 
the  manipulator  evolves  to  the  goal  position  using  the  so  called  “natural  motion”  of  the 
system,  which  provides  considerable  robustness  to  singularities  [e.g.,  16,17].  Additionally, 
a  direct  task-space  implementation  eliminates  the  need  for  an  inverse  kinematic  map  and 
instead  achieves  the  necessary  transformation  from  task-space  to  joint-space  through  a 
mapping  of  generalized  forces  (i.e.,  by  mapping  F  to  T).  This  strategy  has  been  shown 
both  theoretically  and  in  practice  to  provide  significant  robustness  to  singularities  [e.g., 
12,13,17-20].  Thus,  for  the  remainder  of  the  paper,  we  make  the  usual  task-space  control 
assumption  that  the  control  input  F  can  be  commanded  as  desired  (through  the  equivalent 
joint  torque  vector  T  =  JaTF),  and  we  focus  in  our  subsequent  discussion  on  the  adaptive 
regulation  of  the  system  (3). 

3.  Adaptive  Motion  Control 

We  now  turn  to  the  development  of  an  adaptive  strategy  for  position  regulation  of 
uncertain  manipulators  which  requires  only  position  measurements.  Before  presenting  our 
main  results,  we  briefly  consider  the  nonadaptive  case  to  fix  some  ideas  and  introduce  a 
useful  Lyapunov  function. 

3.1  Nonadaptive  Case 

In  [9],  Berghuis  and  Nijmeijer  consider  the  problem  of  robot  regulation  using  only 
position  measurements  and  propose  the  following  elegant  solution  based  on  the  seminal 
work  of  Takegaki  and  Arimoto  [1]: 

F  =  G(x)  +  ky  72w  +  k2 72e  (4) 

2  -  6§ . 

w  =  — 2qw  +  7  e 
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where  e  =  x<*  —  x  is  the  position  regulation  error  (recall  that  Xd  is  constant),  fcj,  &2,7  axe 
positive  scalar  constants,  and  where  we  have  reformulated  the  scheme  given  in  [9]  slightly 
to  make  our  subsequent  analysis  more  convenient.  Observe  that  (4)  is  implementable 
without  velocity  information  because,  although  w  depends  on  e,  the  control  law  requires 
only  w  and  this  term  depends  only  on  e  (as  can  be  verified  through  direct  integration  of 
the  w  equation).  The  scheme  (4)  can  be  viewed  as  a  natural  extension  of  the  well-known 
PD-plus-gravity  control  law  proposed  in  [1],  with  the  velocity  feedback  component  of  the 
controller  in  [1]  replaced  by  a  first  order  linear  compensator.  In  fact,  it  will  be  seen  below 
that  w  provides  a  simple  and  effective  means  of  injecting  damping  into  the  closed-loop 
system  without  using  velocity  feedback.  The  reader  interested  in  additional  background, 
motivation,  and  implementation  details  concerning  the  position  regulation  scheme  (4)  is 
referred  to  the  paper  [9]. 

It  is  shown  in  [9]  that  (4)  globally  stabilizes  the  nominal  manipulator  dynamics  (3)  (i.e., 
the  dynamics  (3)  with  d  =  0)  at  the  equilibrium  e  =  e  =  w  =  0;  however,  the  stability 
analysis  in  [9]  employs  a  Lyapunov  function  with  a  negative-semidefinite  derivative,  so 
that  it  is  not  clear  how  to  extend  this  analysis  to  the  adaptive  case.  In  what  follows,  we 
present  an  alternative  stability  analysis  which  demonstrates  that  the  equilibrium  point  is 
exponentially  stable,  and  which  forms  the  basis  for  our  derivation  of  an  adaptive  solution 
to  the  problem. 

Lemma  1:  The  controller  (4)  exponentially  stabilizes  the  nominal  manipulator  (3)  (with 
d  =  0)  at  the  equilibrium  e  =  e  =  w  =  0  provided  7  is  chosen  sufficiently  laxge. 

Proof:  Applying  the  controller  (4)  to  the  nominal  manipulator  dynamics  (3)  yields  the 
closed-loop  system 


He  +  Vcce  +  &i72w  +  fc272e  =  0  (5) 

w  =  —2jw  +  72e 

Consider  the  Lyapunov  function  candidate 

V\  =  \eTHe  +  ifc272eTe  +  ifc1wTw  +  -^-eTHe  —  —wTHe  (6) 

2  2  2  K17  7 

and  note  that  V\  in  (6)  is  positive-definite  and  radially-unbounded  provided  7  is  chosen 
large  enough.  Differentiating  (6)  along  (5)  and  simplifying  using  the  structural  properties 
of  the  manipulator  dynamics  summarized  in  Section  2  yields 

Vi  =  -(7  -  j^-)eT He  -  ~ki7 1(  w  ||2  +2wT#e 

Ki  7  Ki 
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+  ^iT^v“e  “  F“wl 

~  ~  II  ®  II2  II  e  II2  -hy  II  w  ||2 

+  2Am„(ff)  ||  w  HU  e  ||  + ^  ||  e  mi  e  f  ||  w  ||||  e  ||2  (7) 

where  Amt-n(-),  Ama;c(-)  denote  the  minirmim  and  maximum  eigenvalue  of  the  matrix  ar¬ 
gument,  respectively,  and  kcc  is  a  scalar  upper  bound  on  the  linear  dependency  of  Vcc 
on  x  (i.e.,  |[  Vcc  ||F  <  kcc  ||  x  ||  Vx  with  ||  •  ||F  the  Frobenius  matrix  norm).  Define 
z  =  (llell  lie  ||  ||  w  ||]T  and 


Q 1  = 


o 

(7-&)WlT) 

A  max(H') 


0 

-A  max(H) 


and  note  that  Qj  is  positive-defimte  if  7  is  chosen  sufficiently  large.  The  following  bound 
on  V\  in  (7)  can  then  be  established: 


Vi  <  -zTQiZ  +  ^k^  II  e  nil  e  II2  II  w  ||||  e  ||2 

<-(W<3,)- Yyi'/2)IMI2  (8) 

for  some  scalar  constant  ati  which  does  not  increase  as  j  is  increased.  Let  c\{f)  = 
Amin(Qi)  —  and  choose  7  large  enough  so  that  ci(0)  >  0  (this  is  always  pos¬ 

sible).  The  inequality  (8)  then  implies  that  ci(0)  <  cx(t)  Vt,  so  that  Vx  <  -ci(0)  ||  z  ||2. 
Standard  arguments  can  then  be  used  to  show  that  e,  e,  w  converge  exponentially  to  zero 
[e.g.,  21].  D 


Observe  that  in  the  preceding  stability  analysis  a  negative-definite  Lyapunov  function 
derivative  has  been  obtained.  It  will  be  shown  in  what  follows  that,  as  a  consequence, 
this  analysis  can  be  readily  extended  to  the  adaptive  case.  Note  also  that  the  stability 
properties  established  in  Lemma  1  are  semiglobal  in  the  sense  that  the  region  of  attraction 
can  be  increased  arbitrarily  by  increasing  the  controller  gain  7.  It  is  stressed  that  this  does 
not  imply  that  7  must  be  chosen  to  be  overly  large;  indeed,  we  have  found  that  excellent 
performance  is  obtainable  with  this  gain  set  to  quite  modest  values. 


3.2  Adaptive  Regulation  Scheme 

As  indicated  above,  it  is  desirable  to  eliminate  the  need  for  information  concerning 
the  manipulator  gravity  model  in  position  regulation  strategies.  Consider  the  following 
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adaptive  approach  to  realizing  this  objective: 


F  =  i(t)  +  ki  72w  +  k2  72e 

w  =  — 27W  +  72e  (9) 

f  =  —erf  +  0(e  +  —  — w) 

^7  7 

where  f(t)  €  is  the  adaptive  element,  0  is  a  positive  scalar  constant,  and  o  is  defined 
as  follows:  c  =  0  if  ||  f  ||<  fmax  and  cr  =  a0  otherwise,  with  er0  and  fmax  positive  scalars. 
The  adaptive  element  f  is  intended  to  compensate  for  gravity  torques  G  and  external 
disturbances  d,  0  is  the  adaptation  gain  for  f,  and  a  is  defined  to  provide  a  “switching 
^-modification”  adaptation  law  [22].  This  approach  to  adaptive  control  has  been  proposed 
for  linear  systems  to  provide  robust  performance  in  the  presence  of  external  disturbances 
while  retaining  asymptotic  convergence  of  the  system  error  in  the  ideal  case  in  which  no 
disturbances  are  present  [22].  In  what  follows,  it  will  be  shown  that  the  formulation  (9) 
yields  a  similar  robust  adaptive  performance  for  the  problem  of  regulating  the  nonlinear 
manipulator  system  (3).  Observe  that  (9)  is  implementable  without  velocity  information 
because,  although  w  and  f  depend  on  e,  the  control  law  terms  w  and  f  depend  only  on  e 
(as  can  be  verified  through  direct  integration  of  the  w  and  f  equations). 

The  stability  properties  of  the  proposed  adaptive  regulation  strategy  (9)  are  summa¬ 
rized  in  the  following  theorem. 

Theorem  1:  If  7  is  chosen  sufficiently  large  then  the  adaptive  scheme  (9)  ensures  that 
(3)  evolves  with  all  signals  (semiglobally)  uniformly  bounded  and  so  that: 

1.  In  the  presence  of  external  disturbances,  e  and  e  converge  asymptotically  to  a  compact 
set  which  can  be  made  arbitrarily  small  (by  increasing  0). 

2.  In  the  absence  of  external  disturbances,  e  and  e  converge  asymptotically  to  zero. 

Proof:  The  proof  is  given  in  Appendix  A.  □ 

Several  observations  can  be  made  concerning  the  adaptive  position  regulation  strategy 
(9).  First,  the  stability  analysis  presented  in  Appendix  A  demonstrates  that  this  scheme 
ensures  that  the  manipulator  dynamics  (3)  evolves  from  any  initial  state  to  any  desired 
final  configuration  with  uniform  boundedness  of  all  signals  and  with  desirable  convergence 
properties:  asymptotic  convergence  to  zero  if  no  external  disturbances  are  present  and 
convergence  to  an  arbitrarily  small  neighborhood  of  zero  in  the  presence  of  bounded  dis¬ 
turbances.  Note  that,  in  the  presence  of  disturbances,  the  ultimate  size  of  the  regulation 
error  can  be  reduced  as  desired  simply  by  increasing  a  single  controller  parameter  (the 
adaptation  gain  0).  The  controller  is  ve?y7^imple,  utilizes  only  measurements  of  joint 
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position,  and  does  not  require  knowledge  of  the  structure  or  parameter  values  for  the  ma¬ 
nipulator  model  or  the  payload.  Thus  the  proposed  scheme  provides  a  computationally 
efficient,  modular,  readily  implementable,  and  robust  approach  to  manipulator  position 
regulation  in  task-space  or  joint-space.  An  additional  observation  is  that  the  scalar  con¬ 
troller  gains  ki,  &2,  7,  and  (3  can  be  replaced  with  the  appropriate  diagonal  matrices  for 
increased  flexibility  and  performance;  scalar  gains  axe  used  above  for  simplicity  of  devel¬ 
opment. 

Finally,  it  is  noted  that  robust  and  accurate  position  regulation  with  good  transient 
performance  is  obtainable  with  a  wide  range  of  values  for  the  controller  parameters  k\, 
A;2,  7,  /?,  ooj  and  /mai.  This  may  be  inferred  from  an  examination  of  the  stability  proof 
in  Appendix  A,  where  the  role  of  these  parameters  in  the  closed-loop  system  evolution 
is  made  explicit,  and  is  clearly  demonstrated  in  the  computer  simulations  and  hardware 
experiments  presented  later  in  the  paper.  As  a  consequence,  we  have  found  that  it  is 
easy  to  choose  values  for  these  parameters  which  provide  good  performance  for  a  given 
robot  manipulator;  indeed,  we  have  found  that  a  few  iterations  is  sufficient  to  obtain  such 
parameter  values.  Of  course,  it  is  useful  to  present  some  guidelines  for  use  in  selecting  these 
controller  parameters.  Such  guidelines  may  be  obtained  from  a  study  of  the  stability  proof 
in  Appendix  A  and  an  inspection  of  the  controller  structure  in  (9).  For  example,  reasonable 
values  for  k\ ,  k%,  and  7  can  be  obtained  by  setting  &2  =  2 ki  and  choosing  7  so  that  &272  and 
kij2  are  comparable  to  standard  “PD”  gains  for  the  manipulator,  respectively  (there  are 
well-known  techniques  for  obtaining  such  values).  The  controller  parameter  fmax  should 
be  chosen  so  that  ||  G(x)  +  d(i)  ||<  fmax]  this  is  easily  achieved  since  fmax  can  be  chosen 
so  that  this  inequality  is  satisfied  for  a  very  conservative  upper  bound  on  ||  G(x)  4-  d(t)  ||. 
Finally,  the  adaptation  gain  <tq  should  be  chosen  relatively  small  (compared  with  /?),  while 
the  adaptation  gain  /?  can  be  increased  until  the  desired  positioning  accuracy  is  obtained. 
Of  course,  /?  should  not  be  set  to  an  excessively  large  value  since  this  can  cause  difficulties 
in  digital  implementation  of  the  adaptive  law.  Additional  details  concerning  the  selection 
of  the  controller  parameters  are  provided  in  the  Sections  4  and  5,  where  a  description  of 
the  results  of  the  computer  simulations  and  hardware  experiments  is  given. 

3.3  Extension  to  Tracking  Control 

It  is  interesting  to  note  that  the  proposed  approach  to  controller  development  is  readily 
applicable  to  the  robot  trajectory  tracking  control  problem,  where  the  objective  is  to  ensure 
that  the  manipulator  (3)  evolves  from  its  initial  state  to  the  desired  final  state  along  some 
specified  trajectory  Xd(t)  €  A  (where  X4  is  bounded  with  bounded  derivatives).  Again,  our 
goal  is  to  obtain  a  control  scheme  which  dpe^„not  require  joint  velocity  measurements  or 


knowledge  of  the  manipulator  dynamic  model  for  implementation.  Consider  the  adaptive 
tracking  controller  obtained  from  (9)  through  the  introduction  of  feedforward  elements  in 
the  control  law: 


F  =  A(t)xd  +  B(t)±i  +  f(t)  +  kiy2w  +  fc272e 

w  =  -2yw  +  72e  (10) 

where  A(t)  €  &nX",  B(t)  6  3£nXn  are  (feedforward)  adaptive  gains  which  axe  adjusted 
according  to  the  following  simple  update  laws  (the  adaptation  law  for  f(<)  is  repeated  here 
for  convenience  of  reference): 

f  =  -ax  f  +  ftq 

A  =  -a2A  +  #>  qxj  (11) 

B  =  -azB  +  ^3qxJ 

where  q  =  e  +  k2e/k1j  -  w/7  represents  a  weighted  and  filtered  error  term  and  the  a, 
and  /?,•  are  positive  scalar  adaptation  gains.  The  feedforward  terms  A(t)xd  and  B(t)xd 
which  have  been  added  to  the  regulation  scheme  (9)  to  obtain  the  tracking  controller  (10) 
are  intended  to  compensate  for  the  dynamic  model  terms  Hx  and  Vccx,  respectively;  this 
compensation  is  necessary  if  accurate  tracking  control  is  to  be  achieved  in  an  efficient 
manner.  Observe  that  the  scheme  (10),(11)  is  implementable  without  velocity  information 
because,  although  w,  f,  A,  and  B  depend  on  e,  the  control  law  requires  the  terms  w,  f, 
A,  and  B  and  these  can  be  integrated  so  as  to  depend  only  on  e.  f 

The  stability  properties  of  the  proposed  adaptive  tracking  controller  (10), (11)  axe 
summarized  in  the  following  theorem. 

Theorem  2:  The  adaptive  controller  (10), (11)  ensures  that  e,  e,  w,  f ,  A,  B  are  semiglobally 
uniformly  bounded  provided  7  is  chosen  sufficiently  large.  Moreover,  the  state  trajectory 
tracking  error  e,  e  is  guaranteed  to  converge  exponentially  to  a  compact  set  which  can  be 
made  arbitrarily  small. 

Proof:  The  proof  is  given  in  Appendix  B.  □ 


Several  observations  can  be  made  concerning  the  adaptive  trajectory  tracking  strategy 
(10), (11).  First  notice  that,  although  the  stability  analysis  presented  in  Appendix  B  is 

f  To  see  this  in  the  case  of  A  and  B,  note  that  terms  of  the  form  exj  and  exj  can  be 
integrated  as  follows: 


f 


ex  J  dt  =  ex  J 


which  eliminates  the  dependence  on  e. 
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somewhat  complicated  and  involves  complex  expressions,  the  proposed  control  law  is  very 
simple,  utilizes  only  measurements  of  joint  position,  and  requires  virtually  no  informa¬ 
tion  concerning  the  manipulator  dynamics.  Thus  the  proposed  control  scheme  provides  a 
computationally  efficient,  modular,  and  readily  implementable  approach  to  manipulator 
trajectory  tracking  in  task-space  or  joint-space.  It  is  shown  that  the  controller  ensures 
uniform  boundedness  of  all  signals  in  the  presence  of  bounded  disturbances,  and  that  the 
ultimate  bounds  on  the  trajectory  tracking  errors  can  be  reduced  arbitrarily  by  increas¬ 
ing  the  adaptation  gains  fa.  Notice  also  that  exponential  convergence  ensures  that  the 
transient  behavior  of  the  tracking  errors  will  be  well-behaved.  An  additional  observation 
is  that  the  scalar  controller  gains  ku  k2,  7  and  the  adaptation  gains  and  fa  can  be 
replaced  with  the  appropriate  diagonal  matrices  for  increased  flexibility  and  performance; 
scalar  gains  are  used  above  for  simplicity  of  development.  Note  that,  as  was  the  case  with 
the  position  regulation  strategy  (9),  robust  and  accurate  trajectory  tracking  is  obtainable 
using  (10), (11)  with  a  wide  range  of  values  for  the  controller  parameters.  As  a  conse¬ 
quence,  it  is  easy  to  obtain  parameter  values  which  provide  good  performance  for  a  given 
robot  manipulator,  and  it  is  mentioned  that  the  guidelines  provided  for  selecting  controller 
parameters  for  the  adaptive  regulation  scheme  (9)  are  applicable  here  as  well. 

4.  Simulation  Results 

The  proposed  adaptive  position  regulation  strategy  (9)  and  trajectory  tracking  scheme 
(10), (11)  are  now  applied  to  two  industrial  robots  through  computer  simulation.  The 
robots  chosen  for  this  simulation  study  are  the  six  degree-of-freedom  (DOF)  PUMA  560 
manipulator  and  the  six  DOF  IMI  Zebra  Zero  robot.  The  simulation  environment  in¬ 
corporates  models  of  all  important  dynamic  subsystems  and  phenomena,  such  as  the  full 
nonlinear  arm  dynamics,  joint  stiction,  sensor  noise,  and  transmission  effects,  and  we  there¬ 
fore  believe  that  the  environment  provides  the  basis  for  a  realistic  evaluation  of  controller 
performance  [23].  One  indication  that  this  simulation  environment  does,  indeed,  provide 
the  desired  realism  is  the  close  agreement  between  the  simulated  response  of  the  Zebra 
manipulator  and  the  actual  response  of  this  robot  observed  in  the  hardware  experiments 
presented  in  the  following  Section  Additional  verification  of  the  fidelity  of  the  simulation 
environment  is  presented  in  [11,13,20,23].  For  all  simulations  the  control  laws  are  applied 
to  the  manipulator  model  with  a  sampling  period  of  two  milliseconds,  and  all  integra¬ 
tions  required  by  the  controller  are  implemented  using  a  simple  trapezoidal  integration 
rule  with  a  time-step  of  two  milliseconds.  It  is  noted  that  this  simple  method  of  numer¬ 
ical  integration  is  commonly  used  in  digital  implementation  of  adaptive  control  laws  and 
has  proven  to  be  quite  effective  for  these  plications.  Both  the  PUMA  560  and  Zebra 
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Zero  manipulators  possess  conventional  designs  with  six  revolute  joints  configured  in  a 
“waist-shoulder-elbow- wrist”  arrangement  (see  Figure  1).  However,  the  two  manipulators 
are  very  different  in  size:  the  PUMA  560  robot  is  a  conventional  industrial  arm  while  the 
Zebra  Zero  robot  is  a  much  smaller  laboratory  arm  (for  example,  the  total  reach  of  the 
Zebra  Zero  is  less  than  30in).  It  is  noted  that  throughout  the  computer  simulation  study, 
the  unit  of  length  is  inch,  the  unit  of  time  is  second,  the  unit  of  angle  is  degree,  and  the 
unit  of  force  is  pound,  unless  stated  otherwise. 

The  first  set  of  simulations  demonstrates  the  capability  of  the  proposed  adaptive  regu¬ 
lation  scheme  (9)  to  provide  accurate  task-space  position  regulation.  In  the  first  simulation, 
the  PUMA  robot  is  initially  at  rest  with  joint-space  position  0(0)  =  [0, 90, 180, 0,  —90, 0]T. 
The  robot  is  commanded  to  move  its  end-effector  lOin  in  the  x-direction,  30in  in  the  y- 
direction,  and  20in  in  the  ^-direction,  where  all  motions  are  specified  relative  to  the  base 
coordinate  frame.  Additionally,  the  robot  is  required  to  change  its  end-effector  orientation 
from  an  initial  orientation  matrix  Rinit  to  a  final  orientation  matrix  RfinaU  where 
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and  where  each  rotation  matrix  R  is  specified  relative  to  the  base  frame  of  the  robot 
[14].  Note  that,  alternatively,  the  required  orientation  change  can  be  specified  using  the 
familiar  “equivalent-angle-axis”  parameterization  of  orientation  given  by  ipk  €  Sfc3 ,  where 
k  is  a  unit  vector  aligned  with  the  axis  of  rotation  from  the  initial  to  the  final  desired 
orientation  and  ip  is  the  angle  between  these  orientations  measured  about  this  axis;  using 
this  representation  the  desired  orientation  change  is  t /><£  =  150°  and  =  [0, 0, 1]  .  Thus 
the  PUMA  robot  is  commanded  to  translate  its  end-effector  approximately  37in  and  rotate 
its  end-effector  through  150°. 

The  proposed  adaptive  regulation  strategy  (9)  is  utilized  to  achieve  the  specified  po¬ 
sition  control.  To  provide  a  basis  for  comparison,  a  nonadaptive  controller  and  a  classical 
PD  control  law  axe  also  used  to  provide  this  motion  control.  The  nonadaptive  regulation 
scheme  is  obtained  by  removing  the  adaptive  term  f  from  (9) 

F  =  kjj2w  +  k2 72e  (12) 

w  =  — 2yw  +  y2e 

while  the  PD  controller  is  obtained  from  (12)  by  assuming  that  velocity  information  is 
available: 

F  =  ktfl6+  fc272e  (13) 
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In  this  simulation,  perfect  (noise-free)  velocity  information  is  provided  to  the  PD  strategy 
(13).  For  all  strategies,  the  controller  parameters  ki,k2,y  are  set  as  follows:  k\  =  10, 
k2  =  20,  7  =  5.  In  (9),  the  adaptive  gain  f  is  set  to  zero  initially  and  the  adaptation 
parameters  are  set  as  follows:  <r0  =  0.1,  /?  =  100,  and  fmax  —  500.  These  gains  were 
selected  using  the  guidelines  given  in  Section  3,  and  no  attempt  was  made  to  “tune”  the 
gains  to  obtain  the  best  possible  performance.  Indeed,  we  found  that  excellent  performance 
could  be  obtained  with  (9)  by  using  a  wide  range  of  parameter  values.  For  example,  it 
was  found  that  given  any  reasonable  values  (in  the  sense  of  the  guidelines  given  in  Section 
3)  for  the  parameters  fci,  k2,  and  7,  virtually  any  desired  accuracy  could  be  obtained 
simply  by  increasing  the  adaptation  gain  /?.  The  results  of  this  simulation  axe  given  in 
Figures  2a  and  2b,  with  Figure  2a  showing  the  response  of  the  vertical  position  error 
for  the  end-effector  and  Figure  2b  depicting  the  evolution  of  the  orientation  error  for 
the  end-effector  for  the  three  controllers  (9),  (12),  and  (13).  The  response  for  the  other 
two  end-effector  position  error  components  were  quite  similar  and  hence  axe  not  shown. 
Observe  that  accurate  position  regulation  is  achieved  with  the  adaptive  scheme  (9).  More 
specifically,  the  steady-state  vertical  position  error  and  orientation  error  axe  less  than 
0.002in  and  0.01°,  respectively.  As  expected,  implementation  of  the  nonadaptive  scheme 
(12)  results  in  significantly  degraded  performance  compared  with  the  adaptive  strategy  (9): 
the  steady-state  vertical  position  and  orientation  errors  axe  on  the  order  of  3.3in  and  5.3°, 
respectively,  for  the  nonadaptive  controller  (12).  This  decrease  in  performance  is  most 
substantial  for  the  position  portion  of  the  regulation  task  because  gravity  effects  axe  most 
pronounced  for  positioning.  The  PD  controller  (13)  performs  somewhat  better  than  the 
nonadaptive  strategy  (12)  because  it  has  access  to  full  state  information;  for  this  controller 
the  steady-state  vertical  position  and  orientation  errors  axe  on  the  order  of  2.3in  and  1.1°, 
respectively.  Notice  that  the  steady-state  position  error  is  still  quite  large  because  the 
simple  PD  controller  (13)  has  no  direct  means  of  compensating  for  the  effects  of  gravity 
loads. 

The  next  simulation  demonstrates  the  flexibility  and  modularity  of  the  proposed  adap¬ 
tive  regulation  strategy  (9).  In  this  simulation,  the  Zebra  Zero  robot  is  initially  at  rest 
with  joint-space  position  0(0)  =  [—45°,  0°,  —100°,  0°, 45°,  0°]T.  The  robot  is  commanded 
to  move  its  end-effector  lOin  in  the  x-direction,  lOin  in  the  ^-direction,  and  lOin  in  the 
^-direction,  where  all  motions  are  specified  relative  to  the  base  coordinate  frame.  Addition¬ 
ally,  the  robot  is  required  to  change  its  end-effector  orientation  from  an  initial  orientation 
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matrix  Rinit  to  a  final  orientation  matrix  R final,  where 

'i  o  ol  [i  o  o  ■ 

Rinit  —  0  1  0  ,  R final  =  0  0  — 1 

0  0  lj  [0  10 

and  where  each  rotation  matrix  R  is  specified  relative  to  the  base  frame  of  the  robot  [14]. 
The  equivalent-angle-axis  parameterization  of  this  desired  orientation  change  is  rpd  =  90° 
and  k<2  =  [1,0,0]  .  Thus  the  Zebra  robot  is  commanded  to  translate  its  end-effector 
approximately  17.3in  and  rotate  its  end-effector  through  90°. 

The  proposed  adaptive  regulation  strategy  (9)  is  utilized  to  achieve  the  specified  posi¬ 
tion  control.  In  this  simulation,  the  controller  parameters  k1,k2,  7,  f(0),  cr0,  /?,  and  fmax 
axe  set  to  the  same  values  used  in  the  previous  simulation  with  the  PUMA  560  robot,  even 
though  the  PUMA  arm  is  considerably  larger  and  more  massive  than  the  Zebra.  Note  that 
this  provides  a  means  of  examining  the  flexibility  and  modularity  of  the  control  scheme  (9), 
and  also  demonstrates  that  the  performance  of  the  adaptive  controller  is  quite  insensitive 
to  the  particular  choice  of  controller  parameters. 

The  results  of  this  simulation  are  given  in  Figures  3a  and  3b,  with  Figure  3a  showing 
the  response  of  the  (norm  of  the)  position  error  for  the  end-effector  and  Figure  3b  depicting 
the  evolution  of  the  orientation  error  for  the  end-effector.  Observe  that  accurate  position 
regulation  is  achieved  with  the  adaptive  scheme  (9)  despite  the  fact  that  the  controller 
parameters  are  clearly  not  “timed”  to  provide  good  performance.  In  this  simulation,  the 
steady-state  position  and  orientation  errors  are  less  than  O.OOlin  and  0.01°,  respectively. 

The  last  set  of  simulations  demonstrates  the  capability  of  the  control  scheme  (10), (11) 
to  provide  accurate  task-space  trajectory  tracking.  This  portion  of  the  simulation  study 
consists  of  two  parts:  a  nominal  performance  study  in  which  no  external  disturbances  are 
applied  to  the  manipulator  dynamics,  and  a  robustness  study  where  initial  tracking  errors 
and  a  payload-related  disturbance  are  introduced.  First  we  examine  the  capability  of  the 
control  scheme  (10), (11)  to  provide  accurate  task-space  trajectory  tracking  under  nominal 
conditions,  in  which  no  external  disturbances  are  present.  The  PUMA  robot  is  initially 
at  rest  with  joint-space  position  0(0)  =  [0,90, 180,0,  -90, 0]T.  The  robot  is  commanded 
to  smoothly  cycle  its  end-effector  through  a  sinusoidal  trajectory  with  amplitude  20in  in 
the  r-direction,  20in  in  the  y-direction,  and  20in  in  the  ^-direction  for  6  seconds  according 
to  the  following  temporal  trajectories:  xd(t)  =  +  10(1  -  cos(ir/2)t),yd(t)  =  y{  +  10(1  — 

cos(tt/2 )t),zd(t)  =  Zi  +  10(1  -  cos(7r/2)<),  for  t  €  [0,6],  where  xi,yi,Z{  are  the  initial 
position  coordinates  of  the  end-effector  and  all  motions  are  specified  relative  to  the  base 
coordinate  frame  (see  Figure  1).  Additionally,  the  robot  is  required  to  smoothly  cycle 
its  end-effector  orientation  from  an  initial  orientation  matrix  Rinit  to  a  final  orientation 
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and  where  each  rotation  matrix  R  is  specified  relative  to  the  base  frame  of  the  robot. 
The  desired  temporal  trajectory  for  this  orientation  change  is  specified  as  follows:  ifid  = 
75°(1  —  cos(7r/2 )t)  and  k<f  =  [0,0, 1]T  for  t  €  [0,6]  (see  Figure  1).  Thus  the  robot  is 
commanded  to  rapidly  translate  and  rotate  its  end-effector  through  a  sinusoidal  trajectory 
with  a  translational  amplitude  of  approximately  35in  and  a  rotational  amplitude  of  150° 
for  6  seconds.  The  control  strategy  (10), (11)  is  utilized  to  achieve  the  specified  trajectory 
tracking.  The  adaptive  gains  f ,  A,  B  are  set  to  zero  initially,  while  the  remaining  controller 
parameters  are  set  as  follows:  kj  =  10,  &2  =  20,  7  =  5,  and  a =  0.01,  fli  =  100  for 
i  =  1,2,3.  Note  that  all  of  the  controller  parameters  in  the  tracking  strategy  (10), (11) 
which  also  appear  in  the  regulation  scheme  (9)  are  set  to  the  values  used  in  the  regulation 
experiment  summarized  above,  illustrating  the  modularity  and  flexibility  inherent  in  this 
approach  to  manipulator  motion  control. 

The  results  of  this  simulation  are  given  in  Figures  4a-4c,  and  indicate  that  accurate 
task-space  trajectory  tracking  is  achieved.  For  example,  the  maximum  translational  and 
rotational  tracking  errors  are  less  than  0.05in  and  0.7°,  respectively,  for  this  simulation.  To 
illustrate  the  effects  of  adaptation,  the  same  trajectory  was  tracked  with  the  nonadaptive 
controller  obtained  by  setting  the  adaptation  gains  /?,-  to  zero  in  (10), (11).  The  performance 
of  this  nonadaptive  scheme  is  also  shown  in  Figures  4a-4c,  and  a  comparison  of  the  plots 
makes  clear  the  improvement  in  tracking  obtainable  through  the  use  of  adaptation. 

The  next  simulation  in  this  study  investigates  the  robustness  of  the  controller  (10), (11) 
to  external  disturbances.  More  specifically,  the  effects  of  an  abrupt  change  in  payload  mass 
and  of  initial  positioning  errors  are  examined.  To  study  the  effect  of  these  disturbances, 
the  controller  (10), (11)  is  utilized  to  repeat  the  task  specified  in  the  previous  simulation, 
but  here  a  5.0in  position  error  is  introduced  in  each  translational  DOF,  and  the  payload  is 
abruptly  reduced  from  501b  to  01b  at  the  midpoint  of  the  trajectory.  In  this  simulation  all 
of  the  controller  parameters  are  set  to  the  values  used  in  the  nominal  study.  The  results 
of  this  simulation  are  given  in  Figures  5a  and  5b,  and  indicate  that  accurate  task-space 
trajectory  tracking  is  achieved  despite  the  initial  position  errors  and  the  abrupt  change  in 
payload  mass. 
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5.  Experimental  Results 

The  adaptive  position  regulation  strategy  (9)  and  trajectory  tracking  scheme  (10), (11) 
axe  now  applied  to  an  industrial  robot  in  an  experimental  investigation.  The  experimental 
facility  utilized  for  this  study  is  the  New  Mexico  State  University  Robotics  Laboratory  and 
consists  of  a  IMI  Zebra  Zero  robot  arm  and  the  associated  control  computer  (see  Figure 
1).  The  Zebra  robot  possesses  a  conventional  design  with  six  revolute  joints  configured  in 
a  “waist-shoulder-elbow-wrist”  arrangement  (see  Figure  1).  All  control  software  is  written 
in  ‘C’  and  is  hosted  on  an  IBM-compatible  486  personal  computer.  A  complete  description 
of  this  flexible  and  modular  experimental  testbed  is  beyond  the  scope  of  this  paper;  the 
interested  reader  is  referred  to  [23]  for  such  a  description.  It  is  noted,  however,  that 
the  open  architecture  environment  is  designed  to  allow  convenient  implementation  and 
experimental  evaluation  of  advanced  control  schemes  using  the  Zebra  robot,  and  therefore 
provides  an  ideal  testbed  for  the  present  investigation.  Throughout  the  study  the  unit  of 
time  is  second  and  the  unit  of  angle  is  degree. 

In  this  set  of  experiments,  the  Zebra  manipulator  is  controlled  in  joint-space.  In  the 
first  experiment,  the  Zebra  manipulator  is  initially  at  rest  with  joint-space  position  0(0)  = 
[—45°,  0°,  —100° ,  0° ,  45°,  0°]  ,  and  is  commanded  to  move  to  the  desired  final  configuration 
Q&  —  [45°,  90°,  —100°,  0°,  45°,  0°]T.  Thus  the  waist  and  shoulder  joints  are  each  requested 
to  move  through  90°  and  to  initiate  and  terminate  the  motion  with  zero  velocity,  while 
the  remaining  joints  are  commanded  to  remain  in  their  initial  positions.  This  position 
control  task  is  performed  using  the  adaptive  regulation  scheme  (9)  and,  to  provide  a  basis 
for  comparison,  the  task  is  also  performed  using  the  IMI  controller  which  is  provided  with 
the  Zebra  manipulator.  To  illustrate  the  effect  that  payload  variation  has  on  controller 
performance,  the  desired  regulation  task  is  repeated  for  two  payloads:;  one  of  01b  and  one  of 
21b  (recall  that  the  Zebra  manipulator  is  a  small  laboratory  robot,  so  that  a  21b  payload  is  a 
significant  load).  The  adaptive  control  strategy  (9)  is  implemented  with  the  adaptive  gain  f 
initialized  to  zero  and  the  remaining  controller  parameters  set  as  follows:  ki  =  10,  k2  =  20, 
7  =  5,  <To  =  0.1,  /?  =  100,  fmax  —  500.  These  gains  were  selected  using  the  guidelines 
given  in  Section  3,  and  no  attempt  was  made  to  “time”  the  gains  to  obtain  the  best 
possible  performance;  indeed,  we  found  that  a  wide  range  of  parameter  values  provided 
similar  performance.  The  adaptive  controller  is  applied  to  the  robot  with  a  sampling 
period  of  seven  milliseconds,  and  all  integrations  required  by  the  controller  are  implemented 
using  a  simple  trapezoidal  integration  rule  with  a  time-step  of  seven  milliseconds.  It  is 
mentioned  that  this  relatively  slow  sampling  rate  is  the  fastest  one  available  in  the  present 
experimental  setup,  and  does  not  represent  a  limitation  resulting  from  the  computational 
requirements  of  the  adaptive  controller;  iigleggl,  the  adaptive  scheme  (9)  can  be  run  at  a 


16 


much  faster  rate  using  the  computing  equipment  presently  in  place.  Note  that  the  IMI 
controller  operates  with  a  sampling  period  of  one  millisecond,  and  therefore  possesses  a 
considerable  advantage  in  this  experiment. 

The  results  of  this  set  of  experiments  axe  given  in  Figures  6a  and  6b  (01b  payload)  and 
Figures  7a  and  7b  (21b  payload).  It  can  be  seen  from  Figures  6a  and  6b  that  the  adaptive 
scheme  (9)  and  IMI  controller  provide  comparable  performance  if  there  is  no  payload 
present,  despite  the  fact  that  the  IMI  control  loop  is  seven  times  faster  than  the  adaptive 
control  loop.  The  steady-state  error  for  both  the  waist  joint  (Figure  6a)  and  shoulder  joint 
(Figure  6b)  are  approximately  0.1°  for  the  adaptive  scheme  and  0.2°  for  the  IMI  controller. 
An  examination  of  Figures  7a  and  7b  reveals  that  the  adaptive  controller  provides  virtually 
identical  performance  despite  the  presence  of  the  21b  payload,  while  the  accuracy  of  the  IMI 
controller  degrades  noticeably.  As  expected,  the  effect  of  the  payload  is  most  evident  with 
the  shoulder  joint,  since  it  is  this  joint  which  must  move  against  gravity  torques  (see  Figure 
lb).  More  specifically,  in  the  presence  of  the  payload  the  shoulder  joint  steady-state  error 
for  the  adaptive  scheme  (9)  increases  slightly  to  approximately  0.2°,  while  with  the  IMI 
controller  the  steady-state  error  for  this  joint  increases  significantly  to  approximately  3.0°. 
This  experiment  provides  an  illustration  of  one  of  the  advantages  of  the  adaptive  approach 
to  manipulator  control:  uniform  performance  in  the  presence  of  variation  in  payload. 

In  the  next  experiment,  the  adaptive  trajectory  tracking  scheme  (10), (11)  is  applied 
to  the  problem  of  tracking  a  user-specified  trajectory  for  the  manipulator  joint  angles. 
The  Zebra  manipulator  is  initially  at  rest  with  the  same  joint-space  position  0(0)  as  in 
the  regulation  experiments.  The  waist  joint  is  commanded  to  smoothly  move  from  its 
initial  position  of  0i  =  —45°  to  an  intermediate  position  of  Q\  =  45°,  and  then  to  return 
to  its  initial  position;  the  desired  temporal  trajectory  for  this  motion  is  specified  to  be 
01  d  =  — 45°  +  45° (1  —  cos(x/2)t)  for  t  €  [0,4].  At  the  same  time,  the  shoulder  joint  is 
commanded  to  move  from  02  =  0°  to  02  =  60°  and  then  back  to  02  =  0°  according  to 
the  trajectory  02<f  =  30°(1  —  cos(7r/2)t)  for  t  €  [0,4].  All  of  the  other  manipulator  joints 
are  commanded  to  remain  in  their  initial  positions.  Thus  the  manipulator  is  required  to 
track  a  large  and  rapid  joint-space  trajectory  with  no  a  priori  information  concerning  the 
system  model.  The  control  strategy  (10), (11)  is  utilized  to  achieve  the  specified  trajectory 
tracking.  The  adaptive  gains  f ,  A ,  B  axe  set  to  zero  initially,  while  the  remaining  controller 
parameters  axe  set  as  follows:  Aq  =  10,  k2  =  20,  7  =  5,  and  =  0,  /?;  =  100  for 
*  =  1,2,3.  Note  that  all  of  the  controller  parameters  in  the  tracking  strategy  (10), (11) 
which  also  appear  in  the  regulation  scheme  (9)  axe  set  to  the  values  used  in  the  regulation 
experiment  summarized  above,  illustrating  the  modularity  inherent  in  this  approach  to 
manipulator  motion  control.  The  adaptive  -c&htroller  is  again  applied  to  the  robot  with 


17 


a  sampling  period  of  seven  milliseconds,  and  all  integrations  required  by  the  controller 
are  implemented  using  a  simple  trapezoidal  integration  rule  with  a  time-step  of  seven 
milliseconds. 

The  results  of  this  experiment  are  given  in  Figures  8a  and  8b.  During  the  arm  motion, 
the  joint  encoder  counts  are  recorded  directly  from  the  arm,  converted  into  degrees,  and 
stored  in  a  data  file.  The  maximum  tracking  errors  incurred  at  the  waist  and  shoulder 
joints  are  less  than  0.8°  and  0.7°,  respectively.  Figures  8a  and  8b  show  the  desired  and 
actual  trajectories  of  the  waist  and  shoulder  joints,  respectively,  and  confirm  that  accurate 
joint-space  trajectory  tracking  is  achieved. 

6.  Conclusions 

This  paper  has  introduced  a  new  approach  to  adaptive  motion  control  of  uncertain 
robot  manipulators.  The  primary  contributions  of  the  paper  are  as  follows: 

1.  The  position  regulation  problem  has  been  studied,  and  an  adaptive  solution  to  this 
problem  has  been  proposed  which  does  not  require  joint  velocity  measurements  or 
knowledge  of  the  gravity  model  for  the  manipulator  or  payload.  To  the  best  of  our 
knowledge,  this  is  the  first  control  scheme  capable  of  compensating  for  both  of  these 
sources  of  uncertainty  (other  than  the  preliminary  result  presented  by  the  authors  in 
[11]).  The  proposed  controller  is  simple,  easy  to  implement,  ensures  uniform  bound¬ 
edness  of  all  signals  in  the  presence  of  bounded  external  disturbances,  and  possess  de¬ 
sirable  convergence  properties:  asymptotic  convergence  if  no  disturbances  axe  present 
and  convergence  to  an  arbitrarily  small  neighborhood  of  zero  in  the  presence  of  dis¬ 
turbances.  It  is  noted  that  even  the  state  feedback  version  of  this  result  (i.e.,  the 
controller  obtained  from  (9)  by  replacing  w  with  e  in  the  case  that  velocity  is  accu¬ 
rately  measurable)  represents  a  new  contribution. 

2.  It  has  been  demonstrated  that  the  regulation  scheme  can  be  modified  to  provide  ac¬ 
curate  tracking  control  through  the  introduction  of  additional  feedforward  elements  in 
the  control  law.  The  resulting  tracking  strategy  is  very  general  and  computationally 
efficient,  requires  virtually  no  information  regarding  the  manipulator  dynamic  model, 
is  implementable  without  velocity  measurement s ,  and  ensures  uniform  boundedness  of 
all  signals  and  (ultimately)  arbitrarily  accurate  tracking. 

3.  The  results  of  extensive  computer  simulations  and  hardware  experiments  have  been 
presented  to  demonstrate  the  performance  achievable  with  the  proposed  controllers. 
The  simulations  show  that  the  adaptive  schemes  consistently  and  significantly  outper¬ 
form  their  nonadaptive  counterparts;  additionally,  these  simulations  indicate  that  the 
proposed  regulation  scheme  provides  pdffJrcnance  superior  to  a  standard  PD  controller, 
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despite  the  fact  that  we  supply  the  PD  controller  with  perfect  (no  noise)  velocity  in¬ 
formation.  The  hardware  experiments  confirm  that  the  proposed  approach  provides 
a  simple  and  effective  means  of  obtaining  high  performance  motion  control.  For  ex¬ 
ample,  these  experiments  show  that  the  adaptive  regulation  scheme  is  more  accurate 
than  a  well-tuned  industrial  controller. 

Future  work  will  focus  on  the  application  of  the  proposed  approach  of  controller  de¬ 
velopment  to  the  problem  of  robot  compliant  motion  control,  and  will  initially  involve  an 
attempt  to  generalize  the  results  obtained  in  [20]  so  that  velocity  measurements  are  no 
longer  required.  Progress  along  these  lines  is  reported  in  [24]. 
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9.  Appendix  A:  Proof  of  Theorem  1 

In  this  Appendix,  the  proof  of  Theorem  1  is  given;  additional  details  concerning  this 
analysis  can  be  found  in  the  report  [25].  We  consider  first  the  case  in  which  no  external 
disturbances  are  acting.  Applying  the  control  law  (9)  to  the  nominal  manipulator  dynamics 
(3)  (with  d  =  0)  yields  the  closed-loop  error  dynamics 

He  +  Vcce  +  &i72w  +  k2 72e  +  $  +  G(x«j)  -  G(x)  =  0 
w  =  —  2yw  +  72e  (Al) 

f  =  -erf  +  /?(e  +  -j^-e  -  -w) 

*i7  7 

where  $  =  f  —  G(x^).  Let  U(x)  denote  the  gravitational  potential  energy  of  the  manipu¬ 
lator.  Consider  the  Lyapunov  function  candidate 

V2  =  Vi  +  +  U(x)  -  U(xd)  +  GT(xd)e  (A2) 

and  note  that  V2  is  a  positive-definite  and  radially-unbounded  function  of  e,  e,  w  and  $ 
if  7  is  chosen  sufficiently  large  (for  example,  if  7  is  chosen  large  enough  then  the  term 
k2j2eTe/2  +  U(x)  —  U(xd)  +  GT(x^)e  can  be  shown  to  be  positive-definite  in  e  using  an 
analysis  analogous  to  the  one  given  in  [3]). 

Computing  the  derivative  of  (A2)  along  (Al)  and  simplifying  as  in  the  proof  of  Lemma 
1  yields 

V,  <  -(7  -  II  e  IP  II  e  IP  -h!  II  w  f 

+  2Am„(ff)  ||  w  mi  e  ||  II  e  II II  e  IP  II  w  ||||  e  ||2 
+  ^*T(f  -  ?[e  +  -  ^D+  II  GM  -  G(x)  ||||  -  iw  || 

<  -(7  -  ^)W#)  II  e  IP  -(^  -  ||  e  |p  7  ||  w  |p  (A3) 

+  2A m„(H)  II  w  nil  e  II  II  w  mi  e  ||  +^f  ||  e  ||||  i  |p  +%  ||  w  ||||  e  ||2 

7  «i7  7 

where  M  is  a  positive  scalar  constant  satisfying  M  ||  xd  —  x  ||>||  G(x<f)  —  G(x)  ||  \txd,x 
(the  boundedness  of  the  partial  derivatives  ensures  that  such  an  M  exists),  and  where 
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and  note  that  Q2  is  positive-definite  if  7  is  chosen  large  enough.  Then  the  following  bound 
on  V2  in  (A3)  can  be  obtained  through  routine  manipulation: 


v2  <  -(A mi„(Q2)  -  jV2'/2)  ||  z  IP  (A4) 

where  0:2  is  a  scalar  constant  which  does  not  increase  as  7  is  increased.  Let  c2(t)  = 
^min{Q2)  ~  oc2V21^2(t)/^f  and  choose  7  large  enough  so  that  c2(0)  >  0  (this  is  always 
possible).  Then  (A4)  implies  that  c2(0)  <  c2(t)  Vt,  so  that  V2  <  -c2( 0)  ||  z  ||2.  Standard 
arguments  can  then  be  used  to  show  that  all  signals  are  bounded  and  that  z  (and  therefore 
e,  e,w)  converges  to  zero  asymptotically  [e.g.,  21]. 

We  consider  next  the  case  in  which  external  disturbances  are  present.  Applying  the 
adaptive  scheme  (9)  to  the  disturbed  manipulator  dynamics  (3)  (with  d  nonzero)  yields  the 
closed-loop  error  dynamics 


He  +  Vcce  +  &!72w  +  k2  -y2e  +  <&d  =  0 
w  =  -27W  +  72e  (A5) 

f  =  — <rf  +  /?(e  +  j-^-e  -  — w) 
fci7  7 

where  =  f  —  G  —  d.  Define  the  Lyapunov  function  candidate 

Vz  =  Vi  +  —  (A6) 

and  note  that  V2  is  a  positive-definite  and  radi ally-unbounded  function  of  e,  e,  w  and 
if  7  is  chosen  sufficiently  large.  Differentiating  (A6)  along  (A5)  and  simplifying  as  in  the 
proof  of  Lemma  1  yields 

v.  <  —(7  -  II  e  IP  -ffi  II  e  IP  -*lT  ||  w  IP 

+  2A „.,(#)  ||  w  ID  e  ||  ||  a  HI  e  f  ||  w  ||||  e  |P 
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Defining  z  as  in  Lemma  1  and  Qz  as 


Qz  = 


'  kh 

ki 

0 

0 


0 

(i  - 

Amax(-^0 


0 

hi 


(and  noticing  that  Qz  is  positive-definite  if  7  is  chosen  large  enough)  permits  the  following 
bound  on  V3  in  (A7)  to  be  obtained: 
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If  f max  is  selected  so  that  ||  G(x)  +  d (t)  ||<  fmax  Vx,  t  then  the  following  bound  on  V3  in 
(A8)  can  be  derived  through  routine  manipulation: 

V3  <  -(Amin(Q3)  -  ^ Vz '*)  ||  z  ||2  ||  *d  ||2  +1 

where  a3  and  rj  are  scalar  constants  which  do  not  increase  as  7,  /?  increase. 

The  remainder  of  the  proof  will  utilize  the  ultimate  boundedness  result  presented  by 
the  authors  in  [11,20],  which  is  based  on  the  work  of  Corless  and  Leitmann  [26].  Toward 
this  end,  it  is  convenient  to  bound  14  in  (A6)  in  the  following  manner: 


where  the  A;  are  positive  scalar  constants  independent  of  0.  Now  choose  two  scalar  con¬ 
stants  Vm ,  Vm  so  that  Vm  >  Vm  >  14(0),  and  define  cm  =  Amjn(Q3)  —  oczVm1^2 /ij  then 
choose  7  large  enough  so  that  cM  >  0  (this  is  always  possible).  Let  6*  =  A zv/^M  + 
2max(2/^as,  ?//cr0)  and  choose  0q  so  that  6*/0q  <  Vm  (this  is  always  possible).  Then 
selecting  0  >  0O  ensures  that  if  Vm<Vz<  VM  then  V3  <  0.  This  condition  together  with 
VM  >  Vm  >  14(0)  imphes  that  V3(t)  <  VM  Vi,  so  that  c3(t)  =  Xmin(Qz)  -  a3V31/2(t)/-y  > 
cm  >  0  Vi  and 
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where  A0  =  0  -  0O  and  it  is  assumed  that  0  is  chosen  so  that  A0  >  0.  It  follows 
immediately  that  ||  z  ||  and  ||  $d  ||  are  uniformly  bounded.  Moreover,  the  ultimate 
boundedness  result  given  in  [11,20]  is  now2directly  applicable  and  permits  the  conclusion 
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that  ||  z  ||  converges  asymptotically  to  a  closed  ball  of  radius  r2  =  8* /{ Ai(/?o  +  A/1)).  This 
completes  the  proof  of  Theorem  1. 


10.  Appendix  B:  Proof  of  Theorem  2 

In  this  Appendix,  the  proof  of  Theorem  2  is  given;  additional  details  concerning  this 
analysis  can  be  found  in  the  report  [27].  Applying  the  control  law  (10)  to  the  disturbed 
manipulator  dynamics  (3)  (with  d  nonzero)  yields  the  closed-loop  system  dynamics 

He  +  Vcce  +  fci72w  +  k2j2e  +  +  ^x,*  +  $B*d  +  Vcde  =  0 

w  =  -27W  +  72e  (£1) 


where  3>/  =  f  —  G  —  d,  $,4  =  A  —  H,  =  B  —  Vcd ,  and  the  notation  Vcd  =  I4c(x,x<f) 
is  introduced.  Note  that  in  obtaining  (Bl)  the  Coriolis/centripetal  term  was  expanded  as 
follows:  VccXd  =  Vcdx  =  Vcd'x.d  —  Vcd Consider  the  Lyapunov  function  candidate 


V4  —  Vi  +  +  -tr[— (. B2 ) 

and  note  that  V4  is  a  positive-definite  and  radially-unbounded  function  of  e,  e,  w, 

and  if  7  is  chosen  sufficiently  large.  Computing  the  derivative  of  (B2)  along  (Bl) 
and  simplifying  using  the  adaptation  laws  (11)  yields 
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where  kcd  is  an  upper  bound  on  Vcd  (i.e.,  ||  Vcd  || p-  <  kcd  Vx),  vmax  is  an  upper  bound  on 
||  Xd  ||,  and  the  m  are  positive  scalar  constants. 

Observe  that  the  following  inequalities  can  be  obtained  through  routine  manipulation: 

-?*-<*)  II  e  II2  II  */  II2  II  e  01  #/  II  +f  II  *,\\<l 

II  e  f  II  iA  \\\  +|  II  e  ||||  4^  ||F  +|  ||  ||F  < 

-f  Ami„(8)  ||  e  ||2  -£jj-  ||  iB  |||.  +g-  ||  e  HU  $B  ||F  +jg.  ||  4>s  ||F  < 


where  777,  r/g,  779  are  positive  scalar  constants  which  do  not  increase  as  the  /?,-  are  increased. 
These  inequalities  permit  the  following  upper  bound  on  V4  in  (B3)  to  be  established: 
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and  note  that  Q  is  positive-definite  if  7  is  chosen  large  enough.  If  0max/ Pmin  is  fixed  then 
there  exist  positive  scalar  constants  7710,  7711  that  do  not  increase  as  7  and  Pmin  increase, 
and  positive  scalar  constants  A,-  independent  of  7  and  0mini  such  that  V4  and  V4  in  (B2) 
and  (B4)  can  be  bounded  as 


Ai  II  *  II2  +-jh-  II  *  ||2<  Vt  <  As  II  z  II2  +-h-  II  *  II2 

Pmin  Pmin 

Vi  <  — (Amin(Q)  -  ^U1'2)  ||  Z  II2  --rh-  ||  *  ||2  +fl 

I  Pmin  Pmi't 


Now  choose  two  scalar  constants  VM,Vm  so  that  Vm  >  Vm  >  1^(0),  and  define  cm  = 
^min(Q)— VioVm1/2 /l',  then  choose  7  large  <2n$tigh  so  that  cm  >  0  (this  is  always  possible). 
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Let  8  —  max ( A3 / cm ,  A4/A5)  and  choose  ,3q  so  that  771  j S /  0o  <  V’m  (this  is  always  possible). 
Then  selecting  0min  >  /50  ensures  that  if  Vm  <  V4  <  Vm  then  V4  <  0.  This  condition 
together  with  Vm  >  Vm  >  1^(0)  imphes  that  V4(t)  <  VM  Vi,  so  that  c(f)  =  \min(Q)  ~ 
VioV41/2(t)/i  >  cm  >  0  Vt  and 


V4  <  -CM  z 


Ac 


(0  0  +  A/5) 


mi 


(0o  +  A/5) 


where  A/5  =  /5min  -  /50  and  it  is  assumed  that  0min  is  chosen  so  that  A/5  >  0.  An  analysis 
identical  to  the  one  used  in  the  proof  of  Theorem  1  now  applies  and  permits  the  conclusion 
that  ||  z  ||,||  ^  ||  are  uniformly  bounded,  which  imphes  that  e,  e,  w,  f,  A,  and  B  are 
uniformly  bounded.  Moreover,  ||  z  ||,  ||  \Er  ||  converge  exponentially  to  the  closed  balls  Bri, 
Br2 ,  respectively,  where 


n 


(  smi  \ 1/2 

\Ai(^o  +  A/5) ) 


r2  = 


1/2 


Observe  that  the  radius  of  the  ball  to  which  ||  z  ||2=||  e  ||2  +  ||  e  ||2  +  ||  w  ||2  is  guaranteed 
to  converge  can  be  decreased  as  desired  simply  by  increasing  A/5.  This  completes  the  proof 
of  Theorem  2. 
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Figure  lb:  Illustration  of  Zebra  Zero  manipulator 
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time  (seconds) 

Figure  2a:  Adaptive  (solid),  nonadaptive  (dashed),  and  PD  (dotted)  controlled  response 
of  end-effector  coordinate  y  in  PUMA  arm  simulation  study  (position  regu¬ 
lation) 


Figure  2b:  Adaptive  (solid),  nonadaptive  (dashed),  and  PD  (dotted)  controlled  response 
of  end-effector  orientation  angle  xjj  in  PUMA  arm  simulation  study  (position 
regulation)  2-92 


orientation  error  norm  (degrees) 


time  (seconds) 

Figure  3a:  Response  of  Zebra  robot  end-effector  position  error  norm  in  simulation  study 
(position  regulation) 


Figure  3b:  Response  of  Zebra  robot  end-effector  orientation  error  norm  in  simulation 
study  (position  regulation) 


time  (seconds) 


Figure  4a:  Desired  (solid),  adaptive  (dashed),  and  nonadaptive  (dotted)  response  of  end- 
effector  coordinate  y  in  PUMA  arm  simulation  study  (trajectory  tracking, 
nominal  case) 


Figure  4b:  Desired  (solid),  adaptive  (dashed),  and  nonadaptive  (dotted)  response  of  end- 
effector  coordinate  z  in  PUMA  arm  simulation  study  (trajectory  tracking, 
nominal  case)  2-94 


Figure  4c:  Desired  (solid),  adaptive  (dashed),  and  nonadaptive  (dotted)  response  of 
end-effector  orientation  angle  ip  in  PUMA  arm  simulation  study  (trajectory 
tracking,  nominal  case) 


Figure  5a:  Desired  (solid)  and  adaptive  (dashed)  response  of  end-effector  coordinate  y 
in  PUMA  arm  simulation  study  (trajectory  tracking,  robust  case) 
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in  experimental  study  (position  regulation,  no  payload) 
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Figure  7a:  Adaptive  (solid)  and  IMI  (dashed)  controlled  response  of  Zebra  waist  angle 
in  experimental  study  (position  regulation,  payload) 
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angle  in  experimental  study  (position  regulation,  payload) 


mental  study  (trajectory  tracking) 
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Figure  8b:  Desired  (solid)  and  actual  (dashed)  response  of  Zebra  shoulder  angle  in  ex¬ 
perimental  study  (trajectory  tracking) 
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Abstract 

This  article  presents  two  adaptive  schemes  for  compliant  mo¬ 
tion  control  of  dexterous  manipulators.  The  first  scheme  is 
developed  using  an  adaptive  impedance  control  approach  for 
torque-controlled  manipulators .  whereas  the  second  strategy 
is  an  adaptive  admittance  controller  for  position-controlled 
manipulators.  The  proposed  controllers  are  very  general  and 
computationally  efficient,  as  they  do  not  require  knowledge  of 
the  manipulator  dynamic  model  or  parameter  values  of  the 
manipulator  or  the  environment  and  are  implemented  without 
calculation  of  the  inverse  dynamics  or  inverse  kinematic  trans¬ 
formation.  It  is  shown  that  the  control  strategies  are  globally 
stable  in  the  presence  of  bounded  disturbances  and  that  in  the 
absence  of  disturbances  the  ultimate  bound  on  the  size  of  the 
system  errors  can  be  made  arbitrarily  small.  The  capabilities 
of  the  proposed  control  schemes  are  illustrated  through  both 
computer  simulations  and  laboratory  experiments  with  a  dex¬ 
terous  Robotics  Research  Corporation  seven-degrees-of-freedom 
(DOF)  manipulator. 

1.  Introduction 

Manipulators  are  subject  to  interaction  forces  whenever 
they  perform  tasks  involving  motion  that  is  constrained 
by  the  environment,  such  as  assembling  parts  or  exe¬ 
cuting  grinding  or  finishing  operations.  In  such  motions 
the  interaction  forces  must  be  accommodated  rather  than 
resisted,  so  as  to  comply  with  the  environmental  con¬ 
straints.  Recognizing  the  importance  of  compliant  motion 
for  robots,  many  researchers  have  investigated  this  prob¬ 
lem  in  recent  years.  This  intensive  study  has  produced 
two  basic  approaches  to  achieving  compliant  motion: 

The  International  Journal  of  Robotics  Research. 

VoL  14.  No.  3.  June  1995.  pp.  270-280,  2  -  10 

£)  1995  Massachusetts  Institute  of  Technology.  ~ 


force  control(Raibert  and  Craig  1981)  and  impedance 
control  (Hogan  1985).  The  first  approach  is  based  on 
the  observation  that  when  the  manipulator  end-effector 
is  in  contact  with  the  environment,  the  Cartesian  space 
of  the  end-effector  coordinates  can  be  naturally  decom¬ 
posed  into  a  “position  subspace”  and  a  “force  subspace”; 
these  two  subspaces  correspond  to  the  Cartesian  direc¬ 
tions  in  which  the  end-effector  is,  respectively,  free  to 
move  and  constrained  by  the  environment.  The  force  con¬ 
trol  approach  to  compliant  motion  is  then  to  define  and 
track  a  position  (and  orientation)  trajectory  in  the  posi¬ 
tion  subspace  and  a  force  (and  moment)  trajectory  in  the 
force  subspace.  Alternatively,  the  impedance  control  ap¬ 
proach  proposes  that  the  control  objective  should  be  the 
regulation  of  the  mechanical  impedance  of  the  robot  end- 
effector.  Thus,  the  goal  of  the  impedance  controller  is  to 
maintain  a  desired  dynamic  relationship  between  the  end- 
effector  position  and  the  end-effector/environment  contact 
force.  It  is  noted  that  this  distinction  between  force  con¬ 
trol  and  impedance  control  can  be  viewed  as  primarily 
a  philosophical  one  and  that  in  implementation  the  two 
approaches  are  closely  related  (Volpe  and  Khosla  1993). 

During  the  past  decade  there  has  been  considerable 
interest  in  developing  adaptive  control  strategies  for 
robot  manipulators.  This  attention  has  resulted  in  the 
design  of  several  high-performance  unconstrained  rr» 
tion  controllers,  and  the  success  of  these  schemes  has 
stimulated  interest  in  applying  adaptive  control  theory 
to  the  problem  of  robot  compliant  motion  control  ie.g., 
Niemeyer  and  Slotine  1991;  Lu  and  Meng  1991;  Ca.rdli 
and  Kelly  1991;  Lozano  and  Brogliato  1992;  Colbaugh 
et  al.  1993b;  Jean  and  Fu  1993).  Virtually  all  of  the  adap¬ 
tive  compliant  motion  controllers  proposed  to  date  have 
been  developed  by  assuming  that  the  structure  of  are  ma¬ 
nipulator  dynamics  is  known  a  priori  and  that  onh  are 
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constant  inertial  parameters,  which  appear  linearly  in  the 
model,  are  unknown.  We  will  call  these  schemes  model- 
based  adaptive  controllers  because  of  their  reliance  on 
information  concerning  the  manipulator  dynamic  model. 
Although  the  model-based  approach  has  yielded  adaptive 
schemes  for  which  global  stability  can  be  demonstrated, 
there  are  potential  difficulties  associated  with  this  method 
of  controller  development.  For  instance,  the  design  and 
implementation  of  these  controllers  requires  precise 
knowledge  of  the  structure  of  the  entire  robot  dynamic 
model,  which  is  often  unrealistic  in  practice,  and  the  re¬ 
sulting  control  law  can  be  computationally  intensive. 
Recent  studies  have  indicated  that  the  transient  behavior 
of  unconstrained  motion  control  schemes  designed  using 
the  model-based  adaptive  approach  can  be  undesirable 
and  that  they  can  lack  robustness  to  unmodeled  dynam¬ 
ics,  sensor  noise,  and  other  disturbances  (e.g.,  Reed  and 
loannou  1989);  it  is  expected  that  these  problems  may  be 
compounded  in  the  case  of  constrained  motion  because  of 
the  interaction  with  the  environment. 

It  should  also  be  mentioned  that  in  the  design  of  the 
model-based  adaptive  compliant  motion  control  strate¬ 
gies  cited  earlier,  it  is  assumed  that  the  manipulator  joint 
actuators  can  be  accessed  directly  so  that  joint  torque  is 
the  control  input.  Although  this  control  problem  is  cer¬ 
tainly  important  and  is  addressed  in  this  article  as  well, 
it  is  useful  to  recognize  that  typically  the  manipulator  is 
equipped  with  a  position  control  system.  In  such  cases  it 
is  of  interest  to  consider  the  problem  of  specifying  po¬ 
sition  set  points  for  this  “inner-loop”  controller  so  that 
the  desired  compliant  behavior  is  realized.  This  approach 
has  the  advantage  of  being  readily  implementable  with 
industrial  robots  and  as  a  consequence  is  often  adopted 
in  practice.  However,  there  has  been  relatively  little  work 
in  which  the  stability  of  the  complete  system  is  demon¬ 
strated  or  in  which  adaptation  is  introduced  in  the  “outer 
loop”  to  provide  improved  performance. 

This  article  presents  two  new  adaptive  compliant  mo¬ 
tion  control  schemes  designed  to  overcome  the  difficulties 
associated  with  previous  strategies.  The  control  laws 
are  derived  directly  in  task  space  and  do  not  require 
knowledge  of  the  manipulator  dynamic  model  or  para¬ 
meter  values  for  the  manipulator  or  the  environment.  As  a 
consequence,  the  schemes  are  very  general  and  computa¬ 
tionally  efficient  and  are  implementable  with  both  nonre- 
dundant  and  redundant  manipulators.  The  first  scheme  is 
an  impedance  controller  that  adaptively  generates  the  joint 
torques  required  to  provide  the  desired  dynamic  relation¬ 
ship  between  end-effector  position  and  contact  force.  The 
second  compliant  motion  control  strategy  is  an  adaptive 
admittance  controller  for  position-controlled  manipula¬ 
tors.  In  this  force  control  scheme,  position  set  points  for 
the  inner-loop  position  controller  are  adaptively  speci¬ 
fied  to  ensure  that  the  desired  end-effector/environment 


contact  force  is  achieved.  It  is  shown  that  both  control 
strategies  are  globally  stable  in  the  presence  of  bounded 
disturbances  and  that  in  the  absence  of  disturbances  the 
ultimate  bound  on  the  size  of  the  controller  errors  can  be 
made  arbitrarily  small.  The  effectiveness  of  the  proposed 
schemes  is  illustrated  through  both  computer  simulations 
and  laboratory  experiments  with  a  Robotics  Research 
Corporation  Model  K-1207  redundant  manipulator. 

2 .  Preliminaries 

Let  y  define  the  position  and  orientation  of  the  robot  end 
effector  relative  to  a  fixed  user-defined  reference  frame, 
and  note  that,  in  the  most  general  case,  the  elements  of 
y  are  local  coordinates  for  some  smooth  manifold  M 
of  dimension  m.  The  forward  kinematic  and  differential 
kinematic  maps  between  the  robot  joint  coordinates  6  € 
N  and  the  end-effector  coordinates  y  can  be  written  as 

y  =  h(0),  y  z=  J (6)0  (1  j 

where  N  is  a  smooth  manifold  of  dimension  n,  h  :  X  — 
M,  and  J  €  3Rmxn  is  the  end-effector  Jacobian  matrix. 

Because  the  compliant  motion  control  objectives  are 
stated  most  naturally  in  terms  of  task  space  variables, 
we  will  study  the  compliant  control  problem  directly  in 
task  space.  If  the  manipulator  is  nonredundant  (so  that 
m  =  n)  and  is  in  a  region  of  the  workspace  where  J  has 
full  rank,  then  the  elements  of  y  are  generalized  coordi¬ 
nates  for  the  manipulator,  and  this  formulation  presents 
no  difficulties.  A  task  space  formulation  can  also  be  real¬ 
ized  if  the  manipulator  is  kinematically  redundant  (when 
m  <  n)  by  utilizing,  for  example,  the  configuration 
control  approach  (Seraji  1989).  In  what  follows,  we  will 
consider  nonredundant  and  redundant  robots  together 
and  formulate  the  compliant  motion  control  problem  in 
terms  of  a  set  of  n  generalized  coordinates  x  obtained 
by  augmenting  y  with  n  —  m  kinematic  functions  that 
define  some  auxiliary  task  to  be  performed  by  the  ma¬ 
nipulator  (Seraji  1989).  To  retain  generality,  we  shall 
require  only  that  the  elements  of  x  are  local  coordinates 
for  some  smooth  manifold  DC  of  dimension  n  and  that 
the  kinematic  relationship  between  0  and  x  is  known  and 
continuous  and  can  be  written  in  a  form  analogous  to  ( I ): 

x  =  ha(0),  x  =  Ja(6)6  (2) 

where  ha  :  N  — +  X  and  JQ  €  $Rnxn.  Observe  that  for  x 
to  be  a  valid  generalized  coordinate  vector,  the  elements 
of  x  must  be  independent  in  the  region  of  interest;  thus, 
it  will  be  assumed  in  our  development  that  Ja  is  of  full 
rank.  Note  also  that  if  X  is  compact  (as  is  the  case  w  ith 
a  manipulator  possessing  revolute  and/or  finite-travel 
prismatic  joints)  then  the  continuity  of  hfl  implies  thax  DC 
is  compact.  In  what  follows  it  will  be  assumed  that  X  is 
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compact,  and  this  fact  will  prove  useful  in  the  controller 
development. 

Consider  the  manipulator  dynamic  model  written  in 
terms  of  the  generalized  coordinates  x: 

F  =  H(x)x  4  Vcc(x,  x)x  4  G(x)  4  P  4  d(x,  x,  t)y  (3) 

where  F  E  S?n  is  the  generalized  force  associated  with  x, 
H  E  ftnXn  is  the  manipulator  inertia  matrix,  Vcc  €  2ftuxn 
quantifies  Coriolis  and  centripetal  acceleration  effects, 

G  €  is  the  vector  of  gravity  forces,  and  P  € 
is  the  vector  of  forces  and  moments  exerted  by  the  end 
effector  on  the  environment.  The  term  d  6  9?n  is  a  vector 
of  bounded  but  otherwise  arbitrary  disturbances  that  can 
represent  unmodeled  state-dependent  effects  or  time- 
dependent  disturbances.  The  properties  of  the  system 
(3)  that  are  utilized  in  the  controller  development  are 
summarized  in  the  following  lemma. 

Lemma  1.  For  any  set  of  generalized  coordinates  x, 
the  dynamic  model  terms  Hy  G  are  bounded  functions 
of  x  whose  time  derivatives  Hy  G  are  also  bounded  in 
x  and  depend  linearly  on  x;  the  matrix  H  is  symmetric 
and  positive  definite;  and  the  matrices  H  and  Vcc  are 
related  by  H  —  Vcc  4  V£.  In  addition,  the  generalized 
force  F  and  the  physically  realizable  joint-space  control 
torque  T  E  5Rn  satisfy  the  equation  T  =  JaT F.  Finally, 
V^x.  x)u  =  Vrcc(x,u)x  Vu,  and  if  u  is  bounded  and  u 
grows  at  most  linearly  with  x,  then  V^x,  u)  is  bounded 
and  V^(x,u)  grows  linearly  with  x. 

Proof.  All  of  the  properties  except  the  last  one  are  well 
known  if  joint  angles  are  used  as  generalized  coordinates 
and  are  proved  for  arbitrary  generalized  coordinates  in 
Stepanenko  and  Yuan  (1992).  The  last  property  of  the 
Coriolis/centripetal  matrix  Vcc  is  easy  to  see  by  writing 
Vcc  in  terms  of  Christoffel  symbols.  □ 

Observe  that  in  view  of  the  relationship  T  =  JaT F, 
the  assumption  that  Ja  is  full  rank  is  equivalent  to  the 
assumption  that  F  is  always  realizable  as  a  control  input 
through  the  proper  specification  of  T.  Therefore,  our 
formulation  of  the  manipulator  compliant  motion  control 
problem  directly  in  task  space  is  justified,  and  the  focus 
of  the  subsequent  discussion  is  on  the  adaptive  control  of 
the  system  (3).  Our  approach  to  the  design  and  analysis 
of  suitable  controllers  is  based  on  Lyapunov  stability 
theory.  The  following  lemma  summarizes  the  properties 
of  a  certain  class  of  Lyapunov  functions  and  will  be  of 
direct  relevance  in  our  development. 

Lemma  2.  Consider  the  coupled  dynamic  system 

=  fi(xj,x2,0.  x2  =  f2(xi , x2, 0*  Let  Vr(X},x2,f)  be 
a  Lyapunov  function  for  the  system  with  the  properties 

A,  ||  x,  ||2  +A2  ||  x2  |j2<  V  <  A3  ||  x,  ||2  +A4  ||  x2  ||2 
V  <  “As  II  x,  ||2  -A6  ||  x2  ||2  +£.  „ 


where  c  and  the  A^  are  positive  scalar  constants.  De¬ 
fine  <5  =  max(A3/A5,  A4/A6)  and  r*  =  (6e/\ty  2  for 
i  =  1,2.  Then  for  any  initial  state  xi(0),x2(0)  the  sys¬ 
tem  will  evolve  so  that  xi(t),x2(f)  are  uniformly  bounded 
and  converge  exponentially  to  the  closed  balls  B-.%  Br„ 
respectively,  where  BVi  =  {x*  :  ||  X;  ||<  r,*}  (see  Cor- 
less  [1990]  for  a  discussion  of  exponential  convergence  to 
a  closed  ball). 

Proof .  Straightforward  application  of  the  global  expo¬ 
nential  convergence  theorem  of  (Corless  1990)  gives  the 
result.  □ 

3.  Adaptive  Impedance  Controller 

In  this  section  the  proposed  adaptive  impedance  controller 
for  torque-controlled  manipulators  is  developed.  When 
the  end  effector  of  the  manipulator  is  in  contact  with 
the  environment,  the  objective  of  impedance  control  is 
to  cause  the  end  effector  to  respond  to  contact  forces 
according  to  some  user-defined  dynamics.  For  example, 
the  desired  dynamic  relationship  between  the  end-effector 
position  x,  the  end-effector  reference  trajectory  x.  E  X, 
and  the  end-effector/environment  contact  force  P  is  often 
specified  as  follows: 

M(xr  -  x)  +  C(xr  -  x)  +  K(xr  -  x)  =  P.  (4) 

where  xr  is  bounded  and  twice  differentiable,  and  the  ma¬ 
trices  MyCyK  E  Snxn  are  specified  by  the  user  so  that 
the  dynamic  system  (4)  possesses  the  desired  characteris¬ 
tics.  The  proposed  impedance  control  system  consists  of 
two  subsystems:  a  simple  filter  that  characterizes  the  de¬ 
sired  performance  (4),  and  an  adaptive  task  space  control 
scheme  that  generates  the  control  input  F  to  the  manipu¬ 
lator  dynamics  (3)  that  ensures  the  desired  performance  is 
realized.  Recall  that  it  is  assumed  that  Ja  is  full  rank,  and 
this  assumption  together  with  the  relationship  T  =  .JaT F 
ensures  that  this  strategy  is  implementable. 

3.7.  Control  Scheme 

Observe  that  the  impedance  control  objective  can  be 
realized  by  causing  the  end-effector  position  x  to  track 
closely  the  desired  impedance  trajectory  x<*  E  X.  defined 
as  the  solution  to  the  differential  equation: 

Mxd  +  C\d  4-  Kxd  =  -P™  4  Mxr  4  CxT  -  A'xr 

x<f(0)  =  xr(0),  x<*(0)  =  xr(0)  (5) 

obtained  from  the  impedance  model  (4),  where  P-.  is 
the  force/torque  sensor  measurement  of  the  contact  force 
P.  Eq.  (5)  may  be  interpreted  as  defining  a  simple  niter 
that,  given  definitions  for  xr(f ),  M ,  C,  and  K  and  on¬ 
line  measurements  Pm,  characterizes  the  desired  dynamic 
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relationship  between  end-effector  position  and  contact 
force  through  the  specification  of  x<*.  The  function  of 
the  task  space  impedance  controller  is  to  ensure  that  the 
manipulator  dynamics  (3)  evolves  in  such  a  way  that  the 
end-effector  position  x  closely  tracks  Xd  defined  in  (5). 

Let  x*d  =  xd  +  Ae  and  q~e  +  \e  =  xd-x  denote  the 
“modified  velocity”  and  weighted  position-velocity  error, 
respectively,  where  e  =  xd  —  x  is  the  trajectory  tracking 
error  and  A  is  a  positive  scalar  constant.  Consider  the 
following  adaptive  controller: 

F  =  A(t)rd  +  B(t)x$  +  f(t)  +  Pm  +  [2k  +  K(t)  Jq  (6) 

where  k  is  a  positive  scalar  constant  and  f(i)  €  3?n, 

A(t)  €  Rnxn,  B{t)  €  3?nxn,  K(t)  €  3?nxn  are  adaptive 
gains  whose  update  laws  are  defined  to  be 

f  =  -a,f  +  /?,q 

A  =  -a2A  +  /J2q(Xrf)T  (7) 

B  =  -a3B  +  fcq(x'df 
I<  =  -a4/<r  +  /34qqT 

In  (7),  the  are  positive  scalar  constants  and  the  a*  are 
functions  of  the  form  a*  =  a*0  +  an  ||  x  ||,  where  a^an 
are  positive  scalar  constants.  Applying  the  control  law  (6) 
to  the  manipulator  dynamics  (3)  yields  the  tracking  error 
dynamics 

H q+Vccq+2kq-t-<f> j  +$> AXd-\-$> BXd+$> f<q— d*  =  0,  (8) 

where  =  f  —  G,  —  A  -  H,  =  B  ~ 

=  K  +  Kcc(x,x*),  and  d*  =  d  +  P  -  Pm.  Note 
that  in  obtaining  (8)  the  Coriolis/centripetal  term  was 
expanded  as  follows:  Vcc(x,x)xd  =  I4c(x,x^)x  = 

Vcc(x,  x*)x*  —  Vcc(x,x^)q.  Observe  that  e,e  and  e,q 
are  diffeomorphic,  so  that  either  pair  could  be  combined 
with  to  provide  a  valid  representation 

for  the  state  of  the  closed-loop  dynamics  (7),  (8).  Indeed, 
conclusions  concerning  the  evolution  of  the  system  in 
terms  of  the  two  sets  of  coordinates  are  trivial  to  relate, 
because  the  representations  have  coincident  origins  and 
are  linearly  related.  In  what  follows,  the  set  of  coordi¬ 
nates  containing  the  pair  e,  q  will  be  used  as  the  state 
variables  for  the  system. 

The  stability  properties  of  the  proposed  adaptive  con¬ 
troller  (6),  (7)  are  summarized  in  the  following  theorem. 

Theorem  1 .  The  adaptive  controller  (6),  (7)  ensures 
that  e,q,  arc  globally  uniformly  bounded 

for  all  bounded  disturbances  d.  Moreover,  if  there  are  no 
external  disturbances  or  force  measurement  errors  (so  that 
d  =  0  and  Pm  =  P),  then  the  state  error  e,  q  is  guaranteed 
to  converge  exponentially  to  a  compact  set  that  can  be 
made  arbitrarily  small. 


Proof.  The  proof  is  given  in  Appendix  A. 

Several  observations  can  be  made  concerning  the  adap¬ 
tive  control  strategy  (6),  (7).  First  note  that  the  proposed 
control  law  is  very  simple  and  requires  virtually  no  infor¬ 
mation  concerning  the  robot  dynamics,  and  therefore,  it 
provides  a  computationally  efficient,  modular,  and  read¬ 
ily  implementable  approach  to  manipulator  impedance 
control.  Next,  consider  the  situation  where  there  are  no 
external  disturbances  or  force  measurement  errors,  so  that 
d*  =  0.  Theorem  1  states  that  in  this  case  the  tracking 
errors  converge  exponentially  to  a  compact  set  that  can 
be  made  arbitrarily  small  by  increasing  the  adaptation 
gains  Pi.  Note  that  exponential  convergence  ensures  that 
the  transient  behavior  of  the  errors  will  be  well  behaved. 
In  general,  when  disturbances  and  force  measurement 
errors  are  present  (so  that  d*  is  nonzero),  the  size  of  the 
controller  errors  is  influenced  by  the  magnitude  of  d*; 
however,  increasing  the  controller  gains  can  reduce  this 
effect  and  exponential  convergence  of  the  tracking  errors 
to  a  compact  set  is  still  guaranteed. 

3.2 .  Simulation  Results 

The  adaptive  impedance  control  scheme  described  in  Sec¬ 
tion  3.1  is  now  applied  to  a  large  industrial  robot  through 
computer  simulation.  The  robot  chosen  for  this  simulation 
study  is  the  dexterous  7-DOF  Robotics  Research  Corpo¬ 
ration  (RRC)  Model  K-1207  arm  depicted  in  Figure  I. 
The  simulation  environment  incorporates  models  of  all 


Fig.  1 .  Illustration  of  RRC  Model  K-1207  robot  in  contact 
with  a  surface. 
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important  dynamic  subsystems  and  phenomena,  such  as 
full  nonlinear  arm  dynamics,  joint  stiction,  sensor  noise, 
and  transmission  effects,  and  therefore  provides  the  basis 
for  a  realistic  evaluation  of  controller  performance.  The 
control  law  is  applied  to  the  manipulator  model  with  a 
sampling  period  of  2  ms.  and  all  integrations  required  by 
the  controller  are  implemented  using  a  simple  trapezoidal 
integration  rule  with  a  time  step  of  2  ms. 

This  simulation  study  illustrates  how  the  proposed 
impedance  control  scheme  can  accommodate  the  transi¬ 
tion  between  unconstrained  and  constrained  end-effector 
motion  and  demonstrates  that  proper  utilization  of  robot 
redundancy  can  improve  the  performance  of  this  transi¬ 
tion.  In  this  study  a  frictionless  reaction  surface  with  a 
stiffness  of  lOOIb/in  is  placed  in  the  robot  workspace 
so  that  it  is  oriented  normal  to  the  y  axis  and  is  lo¬ 
cated  at  y  =  0.0.  The  initial  robot  configuration  is 
defined  to  be  0(0)  =  [0°,0°,0°,  -90°, 0°,  -90°.0°]r, 
and  the  manipulator  is  initially  at  rest.  The  refer¬ 
ence  trajectory  for  end-effector  translation  is  defined 
as  xr(t)  =  19.675  +  10(1  -  cos  7rf/3),t/r(£)  = 

-26.0+ 13.5(1  -cos  27Tf/3),zr(f)  =  0.0  for  t  €  [0. 1.5]  and 
xr(t)  =  19.675+  10(1  -  cos7Tt/3Ujr(t)  =  1.0,  zr(t)  =  0.0 
for  t  6  [1.5. 3.0].  In  addition,  the  manipulator  is  required 
to  maintain  its  initial  end-effector  orientation.  Note  that 
the  reference  trajectoiy  intersects  the  reaction  surface  at 
approximately  t  =  1.4.  so  that  at  this  point  the  robot 
undergoes  an  abrupt  transition  from  unconstrained  to  con¬ 
strained  motion. 

The  RRC  manipulator  has  an  n  =  7-dimensional  joint 
space  and  an  m  =  6-dimensional  task  space;  thus,  this 
robot  is  redundant  for  the  task  of  end-effector  positioning 
and  possesses  one  degree  of  redundancy.  In  this  simula¬ 
tion  study,  the  available  redundancy  is  utilized  to  reduce 
the  impact  force  by  maximizing  the  objective  function 
GO )  =  hTH~lh  at  the  point  of  impact,  where  n  is  the 
unit  normal  to  the  surface  (Walker  1990).  The  adaptive 
impedance  controller  proposed  in  Section  3.1  is  used  in 
this  study.  The  impedance  parameters  in  (5)  are  defined 
as  M  =  diag[  1  ],  K  =  diag[10.5. 10, 10, 10. 10),  and 
C  =  diag [2y/mnkii].  The  adaptive  gains  f.  A,  B,  K 
are  set  to  zero  initially,  and  the  remaining  controller 
parameters  are  set  as  follows:  k  =  20,  A  =  2, 

<*i0  =  on  =  0.001,  A  =  10  for?  =  1,2 . 4.  The 

control  input  F  is  mapped  to  a  vector  of  joint  torques  T 
using  the  relation  T  =  JaT F.  The  results  of  this  sim¬ 
ulation  are  given  in  Figure  2A-2C.  Figure  2 A  indicates 
that  the  end-effector  closely  tracks  the  reference  trajectory 
in  the  y  direction  until  contact  with  the  reaction  surface 
is  made.  At  contact,  the  robot  smoothly  ceases  to  track 
the  y  component  of  the  reference  trajectory  in  favor  of 
complying  with  the  reaction  surface:  note  from  Figure  2 B 
that  this  behavior  has  very'  little  effect  on  the  robot  mo¬ 
tion  in  the  a-  direction.  It  can  be  seen  from  Fisure  2 C 
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that  the  impact  force  incurred  during  the  transition  from 
unconstrained  to  constrained  motion  is  acceptable. 

4.  Adaptive  Admittance  Controller 

We  now  turn  to  the  development  of  the  proposed  adaptive 
admittance  controller  for  position-controlled  manipula¬ 
tors.  This  approach  is  motivated  by  the  fact  that  industrial 
robots  are  always  equipped  with  a  position  control  sys¬ 
tem  and  addresses  the  problem  of  adaptively  specifying 
position  setpoints  for  this  inner-loop  controller  so  that 
the  desired  compliant  behavior  is  realized.  Thus,  in  this 
method  of  compliant  motion,  the  manipulator  and  position 
controller  are  viewed  together  as  a  single  system  to  be 
controlled  using  position  commands  as  the  control  inputs. 
This  approach  has  the  advantage  of  being  readily  imple- 
mentable  with  industrial  robots.  In  addition,  because  this 
is  a  force  control  methodology,  it  is  appropriate  for  those 
applications  that  require  accurate  regulation  of  the  end- 
effector/environment  contact  force  despite  uncertainty 
regarding  the  location  and  stiffness  of  the  environment. 

In  what  follows,  the  inner-loop  position  controller  is 
assumed  to  have  a  proportional-derivative  (PD)  feedback 
structure,  as  this  is  a  simple  and  well-known  strateev  and 
is  representative  of  existing  industrial  robot  controllers: 
however,  the  analysis  techniques  utilized  here  are  appli¬ 
cable  to  other  inner-loop  controllers  as  well.  Let  xd  £  X 
define  the  desired  task  space  trajectory  and  e  =  xd  -  x  de¬ 
note  the  trajectory  tracking  error.  Consider  the  PD  control 
law 

F=A'ie-A'2x,  (9) 

where  K\,I<2  €  5?nx"  are  constant  feedback  matrices. 
Applying  the  control  law  (9)  to  the  manipulator  dynam¬ 
ics  (3)  yields 

Hx  +  Vccx  +  fc2x  +  A-)  x  +  G  -I-  d  +  P  =  &,  x<f .  (10) 

where,  for  simplicity  of  development,  the  feedback  matri¬ 
ces  are  assumed  to  have  the  form  K\  =  diagffci],  A:  = 
diag[&2],  with  k\ ,  k2  positive  scalar  constants.  The  dif¬ 
ferential  equation  (10)  quantifies  the  dynamics  of  the  PD 
controlled  manipulator. 

4.1.  Control  Scheme 

Consider  the  position-controlled  manipulator  (10)  interact¬ 
ing  with  a  linearly  elastic  environment.  Let  xc  represent 
the  p  generalized  coordinates  to  be  under  compliant  mo¬ 
tion  control  and  xp  be  the  ( n—p )  position-controlled  coor¬ 
dinates.  Assume  that  x  is  ordered  so  that  x  =  [xj  xrjr, 
and  note  that  this  decomposition  can  always  be  achieved 
in  the  constraint  frame  by  using  a  properly  defined  coor¬ 
dinate  transformation  (e.g..  Lozano  and  Brogliato  1 992 1. 
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Fig.  2.  A,  Response  of  end-effector  coordinate  y  in  impedance  control  simulation  example.  B,  Response  of  end-effector 
coordinate  x  in  impedance  control  simulation  example.  C,  End-effector / environment  contact  force  P  in  impedance 
control  simulation  example. 
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The  manipulaior/environment  interaction  force  is  then 
gi’ven  by  P  =  [0T  Pj~]  ,  where  Pc  £  5?p  has  the  form 

-^-env(\:  ^env)»  (11) 

where  K ^  €  J?pxp  and  x*nv  €  3?p  are  the  environmental 
stiffness  and  location,  respectively,  and  Kcnv  =  diag  [A:env] 
is  assumed  for  simplicity  of  presentation. 

Let  Xpd  and  Pd  represent  the  desired  constant  end- 
effector  position  and  contact  force  for  the  manipulator, 
respectively.  The  adaptive  admittance  control  problem 
involves  designing  a  strategy  for  specifying  xd(t)  so  that 
Xp,  Pc  approach  xpdy  Pd  despite  considerable  uncertainty 
regarding  the  manipulator,  the  inner-loop  position  con¬ 
troller,  and  the  environment.  Implicit  in  this  problem 
definition  is  the  objective  of  utilizing  the  admittance  con¬ 
troller  not  only  to  provide  compliance  control  (in  the 
force  subspace),  but  also  to  improve  the  accuracy  of  the 
inner-loop  position  controller  (in  the  position  subspace). 
Consider  the  control  law 


tp(t)  +  Kp(t)(xpd  -  xp) 
fc(£)  +  Ac(0ec 


where  fp  e  fc  €  3 ?p,  Kp  £  5ft<n-p>x(n-p>^ 

Kc  £  3?pxp  are  adaptive  gains  and  ec  —  Pd  -  Pc.  Ap¬ 
plying  the  control  strategy  (12)  to  the  position-controlled 
manipulator  dynamics  (10)  yields  the  error  dynamics 


H'tf  4-  Vccef  —  kitj  4-  k\tf  4- 


(13) 


A*env  d  —  0, 


where  tf  =  [A^v(Xpd  -  Xp)T  j  (?d  -  Pc)rf,  <j>p  = 
^l^envfp  -  A^Gp,  <f>c  =  k{kcuJc  -  AjfcenvXenv  -  (ATj  + 
^cnv)P</  -  A’tnvGc  (j>kp  —  k\Kp.  okc  =  k\kenvKc  4-  kcnvIy 
I  is  the  p  x  p  identity  matrix,  and  the  gravity  vector  is 
partitioned  as  G  =  [Gp  Gj ]T. 

Observe  that  e7  =  ef  =  0,  <£>p  =  <j>c  =  0, 

<PkP  =  <t>kc  =  0  is  an  equilibrium  point  for  (13)  in  the 
absence  of  external  disturbances  (so  that  d  =  0),  and  this 
equilibrium  point  corresponds  to  xp  =  xpdy  Pc  =  Pd  as 
desired.  Thus,  the  control  objective  is  to  choose  update 
laws  for  f  =  [f^  fcr]r  and  AT  =  [Kj  Kj]T  that  ensure 
that  this  equilibrium  is  globally  stable  and  that  a  (small) 
neighborhood  of  this  equilibrium  is  globally  attractive. 
Toward  this  end.  define  the  following  adaptation  laws  for 
f  and  K : 


f~-Qjf~3iq,  K  =  -Q2Ar  -b  faqej  (14) 

where  q  =  e/  -  ee/,  €  is  a  positive  constant,  the  fa  are 
positive  scalar  constants,  and  the  a*  are  functions  of  the 
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form  a,-  —  c*to  4-  o^i  ||  x  ||  with  positive  scalar 

constants. 

The  following  theorem  indicates  that  the  proposed 
adaptive  admittance  controller  (12),  (14)  realizes  the 
control  objective  of  accurate  position  regulation  in  the  po¬ 
sition  subspace  and  force  regulation  in  the  force  subspace. 

Theorem  2.  Suppose  that  the  controller  parameter  e  is 
chosen  so  that 


e  <  min 


~(Mmin(if))l/2  _ h _ 

Amax(tf)  k\/4kx  +  Amax(#)  -  emzxkcc 


where  kcc  satisfies  ||  Vcc  ||F<  kcc  ||  ef  ||  (this  bound  ex¬ 
ists  as  a  direct  consequence  of  Lemma  1)  and  emax  is  an 
upper  bound  on  the  task  space  position  error.  (This  bound 
exists  because  !Nf  is  compact  and  hQ  is  continuous;  see 
also  Lozano  and  Brogliato  [1992],  Wen  et  al.  [1992],  and 
Colbaugh  et  al.  [1994].)  Then  the  adaptive  controller  (12), 
(14)  ensures  that  ej,  e/,  f>p,  <fc,  <pkp,  (fkc  are  globally 
uniformly  bounded  for  all  bounded  disturbances  d.  More¬ 


over,  if  there  are  no  external  disturbances  (so  that  d  =  0), 
then  tj.tf  are  guaranteed  to  converge  exponentially  to  a 
compact  set  that  can  be  made  arbitrarily  small. 


Proof .  The  proof  is  given  in  Appendix  B.  □ 


Several  observations  can  be  made  concerning  the  adaptive 
admittance  control  strategy  (12),  (14).  First,  note  that  the 
proposed  scheme  is  extremely  simple  and  requires  very 
little  information  concerning  the  manipulator  model,  the 
inner-loop  position  controller,  or  the  environment.  For 
example,  the  controller  parameter  selection  process  re¬ 
quires  only  very  conservative  estimates  for  manipulator 
model  terms.  The  adaptive  strategy  (12),  (14)  ensures 
that  in  the  absence  of  external  disturbances,  the  controller 
errors  converge  exponentially  to  a  compact  set  that  can  be 
made  arbitrarily  small  (by  increasing  the  adaptation  gains 
0i).  If  disturbances  are  present  (so  that  d  is  nonzero),  the 
size  of  the  controller  errors  is  influenced  by  the  magni¬ 
tude  of  d;  however,  increasing  the  controller  gains  can 
reduce  this  effect.  Therefore,  the  proposed  controller  pro¬ 
vides  a  computationally  efficient  and  modular  solution  to 
the  manipulator  compliant  motion  control  problem.  The 
fact  that  the  control  scheme  is  designed  to  be  used  with 
position-controlled  manipulators  makes  implementation 
with  industrial  robots  particularly  convenient.  Note  that 
the  weighted  error  q  used  in  the  adaptive  laws  1 14)  con¬ 
tains  ec.  suggesting  that  implementation  of  the  proposed 
controllers  requires  Pc,  which  is  ordinarily  not  available. 

It  is  easy  to  show  by  direct  integration  that,  although  f 
and  K  depend  on  ec,  f  and  K  do  not,  so  that  the  con¬ 
troller  does  not  require  force  derivative  information  for 
implementation. 
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4.2.  Experimental  Results 

The  adaptive  admittance  controller  (12),  (14)  is  now  ap¬ 
plied  to  an  industrial  robot  in  a  series  of  experiments.  The 
experimental  facility  utilized  for  this  study  was  the  Jet 
Propulsion  Laboratory  Supervisory  Telerobotics  Research 
Laboratory,  which  consists  of  a  dexterous  7-DOF  RRC  K- 
1207  redundant  manipulator  and  controller,  a  6-DOF  Lord 
wrist  force/torque  sensor,  a  TRI  servoed  gripper,  a  VME- 
based  real-time  computing  environment  with  six  Motorola 
68020  processor  boards,  a  Sun  4/260  workstation,  and 
a  Silicon  Graphics  IRIS  workstation.  This  laboratory  is 
designed  to  be  a  general  space  telerobotics  research  and 
development  facility.  One  of  the  consequences  of  this  fo¬ 
cus  is  that,  in  its  present  configuration,  the  update  rate  for 
any  outer-loop  control  strategy  that  commands  task  space 
position  setpoints  can  be  no  greater  than  35  Hz.  Note  that 
this  slow  update  rate  presents  a  severe  challenge  to  any 
adaptive  outer-loop  control  algorithm. 

The  adaptive  admittance  control  scheme  (12),  (14) 
is  implemented  in  the  present  experimental  study.  The 
task  specified  in  the  experiment  requires  the  manipulator 
to  approach  the  environment,  accommodate  the  transi¬ 
tion  from  unconstrained  to  constrained  motion,  and  then 
exert  a  user-defined  contact  force  on  the  environment. 

Two  different  environments  are  utilized  in  the  experi¬ 
ments:  a  “soft”  environment  consisting  of  books  stacked 
on  a  cardboard  box,  and  a  “stiff’  environment  obtained 
by  mounting  a  metal  plate  on  a  steel  table.  No  a  priori 
knowledge  of  the  manipulator  model,  the  inner-loop  po¬ 
sition  controller,  or  the  environment  location  or  stiffness 
is  provided  to  the  outer-loop  admittance  control  strategy. 

In  each  experiment  the  reaction  surface  is  placed  in  the 
robot  workspace  so  that  it  is  oriented  normal  to  the  y 
axis  and  is  located  approximately  0.25  m  from  the  initial 
end-effector  position.  The  manipulator  is  commanded  to 
exert  a  contact  force  of  10.0N  on  the  reaction  surface  and 
to  maintain  its  initial  end-effector  x  and  z  positions  and 
orientation;  note  that  with  this  task  definition  the  manip¬ 
ulator  approaches  the  reaction  surface  under  the  control 
of  the  admittance  control  scheme  and  with  the  objective 
of  reducing  the  contact  force  error.  The  available  manip¬ 
ulator  redundancy  is  utilized  to  reduce  the  impact  force 
using  the  strategy  summarized  in  Section  3.2. 

The  experiments  are  designed  to  evaluate  the  effec¬ 
tiveness  of  the  admittance  controller  (12),  (14)  for  estab¬ 
lishing  stable  contact  and  for  achieving  accurate  force 
regulation  in  the  presence  of  considerable  uncertainty  re¬ 
garding  the  manipulator,  inner-loop  position  controller, 
and  environment.  In  addition,  these  experiments  exam¬ 
ine  the  sensitivity  of  the  controller  to  a  large  variation  in 
environmental  stiffness.  For  all  experiments,  the  adap¬ 
tive  gains  f,  K  are  initialized  to  zero,  e  is  set  equal  to 
1,  and  the  adaptation  gains  are  chosen  to  be  —  0.01 


and  5:  =  0.0001  (for  all  gains,  the  units  for  force  and 
length  are  Newton  and  centimeter).  The  results  of  these 
experiments  are  given  in  Figures  3  and  4.  with  Figure  3 
showing  the  response  of  the  contact  force  for  the  soft 
environment  and  Figure  4  illustrating  the  response  for 
the  stiff  environment.  It  can  be  seen  from  the  figures  that 
in  both  cases  stable  contact  is  rapidly  established,  and 
accurate  force  set  point  regulation  is  quickly  achieved, 
despite  the  lack  of  a  priori  information  and  the  very  slou 
update  rate  (35  Hz).  In  addition,  it  is  observed  that  the 
performance  is  robust  to  large  variations  in  environmental 
characteristics  (cardboard  box  and  steel  plate).  It  should 
be  noted  that  in  the  present  experimental  setup,  gravity 
loading  on  the  force/torque  sensor  is  only  approximately 
compensated  with  calibration,  so  that  forces  with  mag¬ 
nitudes  less  than  2.0  N  are  ignored  in  the  software:  this 
accounts  for  the  offset  that  appears  in  the  plots  of  contact 
force  before  contact  with  the  environment  is  made. 


5.  Conclusions 

This  article  presents  two  new  adaptive  compliant  motion 
control  schemes  for  rigid-link  manipulators.  The  effec¬ 
tiveness  of  the  proposed  schemes  is  illustrated  through 
both  computer  simulations  and  laboratory  experiments 
with  a  dexterous  RRC  7-DOF  manipulator.  These  studies 
indicate  that  the  adaptive  impedance  controller  provides 
an  accurate  and  robust  means  of  achieving  the  desired 
end-effector  impedance  characteristics  and  that  the 
adaptive  admittance  scheme  represents  a  readily  imple- 
mentable  method  of  obtaining  high  performance  force 
control.  Furthermore,  the  investigations  demonstrate  that 
manipulator  redundancy  can  be  used  to  improve  the  per¬ 
formance  of  compliance  control  tasks  involving  abrupt 
transition  from  unconstrained  to  constrained  end-effector 
motion  by  reducing  the  impact  force. 


Appendix  A:  Proof  of  Theorem  1 

In  this  appendix  the  proof  of  Theorem  1  is  sketched; 
a  detailed  proof  of  the  theorem  is  given  in  Colbaugh 
et  al.  (1993a).  Consider  the  Lyapunov  function  candidate 


v  =  ^qT#q  +  kXeTe  +  f 


(Al  j 


1 

+  2* 


and  note  that  V  is  a  positive  definite  and  radially  un¬ 
bounded  function  of  the  closed-loop  system  state.  Com¬ 
puting  the  derivative  of  (Al)  along  (7),  (S »  and  simplify¬ 
ing  using  Lemma  1  yields 
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Fig.  3.  End-effector / environment  contact  force  in  experimental  study  (soft  environment). 


Fig.  4.  End-effector /  environment  contact  force  in  experimental  study  (stiff  environment). 
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where  ||  •  \\p  is  the  Frobenius  norm  and  Vcc  =  V^cCx.x^). 
The  properties  of  HyVCCyG  summarized  in  Lemma  1. 
together  with  the  boundedness  of  xr,  xr,  and  xr,  permit 
the  following  bound  on  V  in  (A2)  to  be  derived: 


V  < 
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11 2  k\2 
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4  ft  11  -4  11 F  4ft 
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(A3) 


where  dmax  is  an  upper  bound  on  d*  ||  and  the  <5.  are 
positive  scalar  constants  that  do  not  increase  as  the  i\ 
increase.  In  obtaining  these  bounds  the  compactness  of 
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N  and  continuity  of  hG  are  utilized  to  show  that  x  and  P 
are  bounded,  and  therefore  xd,  xd,  and  x^  are  bounded. 
Then,  using  Lemma  1 ,  it  can  be  concluded  that  I4c(x,  x^) 
is  bounded  and  grows  linearly  in  e.  Arguments 

of  this  type  concerning  the  implications  of  a  compact 
configuration  manifold  on  the  possible  magnitude  of 
the  configuration  error  have  also  been  used  in  Lozano 
and  Brogliato  (1992),  Wen  et  al.  (1992),  and  Colbaugh 
et  al.  (1994). 

z  =  [||  e  ||  ||  q  ||f,  *  =  [||  3>j  ||  ||  *A  ||F 
II  $B  ||f  II  $k  ||fJt,  Pm'm  =  min(/?i),  and  /3max  = 
ma x(ft).  If  ftmax / fim\n  is  fixed,  then  there  exist  positive 
constants  A*  independent  of  the  ft,  and  positive  constants 
ft  that  do  not  increase  as  fim\n  increases,  such  that  V  and 
V  in  (Al)  and  (A3)  can  be  bounded  as 


V  =  -k2  ||  if  II2  -eiti  ;  e;  ir  -eijHif  (B2 

+  ce/[V«e/  -  k2ef]  +  1  oro 
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a,  II  Z  II2  +£-  II  *  ||2<  V  <  A3  II  z  II2  +£-  II  #  II2 

Pm  in  Pm  in 

V'  <  -AS  ||  z  ||2  ||  *  ||2  +  J-S,  +dmaxJB2 

Pmin  Pmin 

Lemma  2  then  permits  the  conclusion  that  ||  z  ||, 

||  4>  ||  are  uniformly  bounded,  which  implies  that  e,  q, 
3>/, are  uniformly  bounded.  Moreover,  in 
the  case  of  no  external  disturbances  or  force  measurement 
errors  (so  that  dmax  =  0),  ||  z  ||  and  ||  3>  ||  converge 
exponentially  to  the  closed  balls  ft,,  BT2 ,  respectively, 
where  r,  =  ((5B,/A,/?min),/2,  r2  =  (<5£(/A2)l/2,  and 
8  =  max(A3/A5,  A4/A6).  Observe  that  the  radius  of  the 
ball  to  which  ||  z  ||2=||  e  ||2  +  ||  q  ||2  is  guaranteed  to 
converge  can  be  decreased  as  desired  simply  by  increas¬ 
ing  ftnin* 


Appendix  B:  Proof  of  Theorem  2 

In  this  appendix,  the  proof  of  Theorem  2  is  summa¬ 
rized;  additional  details  can  be  found  in  Colbaugh 
et  al.  (1993a).  In  what  follows,  let  <j>  —  <£]T]r 

and  <px  =  block  diag [4>kp,  <pkc]  for  notational  conve¬ 
nience.  Consider  the  Lyapunov  function  candidate 


V  =  -ejHif  +  -M/e/  +  eejHif 


l  /p  1 

+  IPiktka"  +  2/?2A|Ae, 


M<Pk4>k] 


and  observe  that  V  is  positive  definite  and  radially  un¬ 
bounded  in  all  arguments,  provided  €  are  positive 

scalar  constants  and  e  is  chosen  so  that  the  inequality  (15) 
is  satisfied.  Computing  the  derivative  of  (BI)  along  (13), 
(14)  and  simplifying  using  Lemma  1  yields 


where  the  ft  are  positive  scalar  constants  that  do  not 
increase  as  ft,  ft  increase,  the  matrix  0  is  given  by 

q  _  €^i 

.  5^^-  ^ f  Am2X(if )  -  <f€maxA.vr 

and  the  upper  bound  (B2)  was  obtained  using  the  proce¬ 
dure  adopted  in  deriving  the  bound  (A3i  in  the  proof  of 
Theorem  1.  Note  that  if  e  is  chosen  so  that  (15)  is  satis¬ 
fied,  then  Q  is  positive  definite.  Let  z  =  (  tj  ||  ||  e/  ||]r. 
^  =  (II  <P  II  II  <Pk  I.r]r,  -imin  =  min(ft,ft).  and 
finvdx  =  max(ft  ,ft).  If  imax/^min  is  fixed,  then  there  exist 
positive  scalar  constants  A*  independent  of  ft, ft.  and 
positive  constants  ft.  ft  that  do  not  increase  with  ft ,  ft. 
such  that  V  and  V  in  (BI)  and  (B2)  can  be  bounded  as 

a.  II  Z  II2  +^~  II  *  r<  V  <  A3  II  z  2  -2i.  I;  II2 

Omin  ftriin 

V  <  -a5  ||  z  ||2  -  A.  ||  <i>  !|2  +-Lb3  +  dTmBA 

'-'min  Pmsn 

Lemma  2  then  permits  the  conclusion  thai  ,|  z  ||,  |j  || 
are  uniformly  bounded,  which  implies  thai  e/,  e/.  <p, 

<f>K  ^  uniformly  bounded.  Moreover,  in  the  case  of  no 
external  disturbances  (so  that  d ^  =0).  z  ||  and 
||  <i>  ||  converge  exponentially  to  the  closed  balls  ftt, 
ft2,  respectively,  where  —  (dft/Ai r2  = 
(<5ft/A2)!^2,  and  d  =  max(A3/As,  A4/AAL  Observe  that 
the  radius  of  the  ball  to  which  ||  z  ||2=|.  e.*  ||2  +  jj  e/  ||2 
is  guaranteed  to  converge  can  be  decreased  as  desired 
simply  by  increasing  Jmin. 
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By  using  an  integration  by  parts,  the  coordinates  x (t)  and  y(t)  as 
functions  of  6(t)  are 

x(t)  =  ka(cos$  —  cosOo  +  (0  —  0o  + 

-^sin0o)  +  xo 

y(£)  =  &a(sin0  —  sin0o  —  (0  —  Bo  +  ^iL)cos8  . 

+^co se0)  +  yo 

0(t)  =  U>ot  +  0O 

When  vo  <  0,  there  is  a  point  ->(f )  such  that  v(t)  ~  0.  This  is  a 
cusp.  The  corresponding  configuration  c  verifies 


Xc  —  ha(cos$t  -  cos0o  +  (0c  —  0o)sin0o)  +  xo  (11) 

ya  =  Jta(sin  0*  —  sin  0o  —  (0c  —  0o)  cos  8o)  -f  yo  (12) 

;^=_f222.+tfo;:.  as) 

a  ...  ;  . , 
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for  Redundant  Manipulators 
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Abstract-Tins  papa:  presents  a  simple  and  robust  approach  to  achiev¬ 
ing  collision  avoidance  for  kinematically  redundant  manipulators  at  the 
control-loop  level.  The  proposed  scheme  represents  the  obstacle  avoidance 
requirement  as  inequality  constraints  in  the  manipulator  workspace,  and 
ensures  that  these  inequalities  are  satisfied  while  the  end-effector  tracks 
the  desired  trajectory.  The  control  scheme  is  the  damped-least-squres 
formulation  of  the  configuration  control  approach  implemented  as  a 
kinematic  controller.  Computer  simulation  and  experimental  results  are 
given  for  a  Robotics  Research  ?  DOF  rednndant  arm  and  demonstrate 
the  collision  avoidance  capability  for  reaching  inside  a  truss  structure. 
These  results  confirm  that  the  proposed  approach  provides  a  staple  and 


effective  method  for  real-time  collision  avoidance. 


I.'  Introduction 

In  Oct  1990,  the  National  Aeronautics  and  Space  Administration 
(NASA)  initiated  a  research  and  development  project  on  Remote 
Surface  Inspection  (RSI)  at  the  Jet  Propulsion  Laboratory  (JPL). 
The  goal  of  this  project  is  to  develop  and  demonstrate  the  necessary 
technologies  for  the  inspection  of  space  structures,  such  as  the  Space 
Station  Freedom  (SSF),  using  remote  robots  for  sensor  placement 
under  supervisory  controL  The  purpose  of  die  inspection  is  to  monitor 
the  health  of  the  structure  and  assess  possible  damage  by  employing 
a  number  of  sensing  devices  deemed  appropriate  for  the  task.  Fig.  1 
shows  the  Surface  Inspection  Laboratory  which  emulates  the  task 
environment  where  the  inspection  will  be  performed.  Tins  figure 
shows  a  one-third  scale  mock-up  of  part  of  the  truss  structure  of  SSF, 
as  well  as  a  seven  degree-of-freedom  (DOF)  Robotics  Research  Cor¬ 
poration  (RRQ  manipulator  that  carries  the  inspection  sensors  at  its 
end-effector.  Complete  inspection  of  the  truss  structure  and  die  orbital 
replacement  units  mounted  on  it  requires  the  capability  to  reach  safely 
into  the  truss  openings  and  to  maneuver  in  constricted  workspaces 
inside  the  truss.  Therefore,  the  ability  to  perform  automated  collision- 
free  motions  in  a  congested  environment  is  a  fundamental  requirement 
for  the  project 

The  robotic  arm  used  in  the  project  is  kinematically  redundant 
since  it  possesses  more  DOF  than  are  required  to  achieve  the  deshed 
position  and  mentation  of  the  end-effector.  One  of  the  advantages  of 
robot  redundancy  is  the  potential  to  use  die  “extra”  DOF  to  maneuver 
in  a  congested  workspace  and  avoid  collisions  with  obstacles.  Much 
of  the  work  reported  to  date  concerning  collision  avoidance  for 
manipulators  has  dealt  with  high-level  path  planning ,  in  which  die 
end-effector  path  is  planned  off-line  so  as  to  avoid  collision  with 
workspace  obstacles  (see,  for  example,  the  recent  text  [1]  and  the 
references  therein).  Alternatively,  the  obstacle  avoidance  problem  can 
be  solved  on-line  by  the  robot  controller  at  the  low  leveL  Previous 
approaches  to  on-line  obstacle  avoidance  for  redundant  manipulators 
have  focused  cm  die  inverse  kinematics  portion  of  the  robot  control 
problem  (e.g.,  [2}-[7]),  although  Khatib  [8]  and  Colbaugh  et  <d. 
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Fig.  1.  Surface  inspection  laboratory  at  JPL. 


[9]  have  proposed  complete  control  systems  for  redundant  robots 
maneuvering  in  the  presence  of  obstacles. 

In  this  paper,  attention  is  focused  on  the  problem  of  controlling  a 
redundant  robot  in  real-time  so  that  it  closely  tracks  the  desired  end- 
effector  trajectory  and  simultaneously  avoids  workspace  obstacles. 
The  proposed  approach  is  to  represent  the  collision  avoidance  require¬ 
ment  as  a  set  of  kinematic  inequality  constraints,  and  then  ensure  that 
these  constraints  are  satisfied  during  the  desired  end-effector  motion. 
The  scheme  is  implemented  at  the  differential  inverse  kinematic 
level  using  a  damped-least-squares  implementation  of  configuration 
control  [10],  [11];  thus  it  is  assumed  that  a  joint-space  position 
controller  is  available.  The  simplicity  and  computational  efficiency  of 
the  proposed  approach  allows  obstacle  avoidance  to  be  implemented 
in  a  dynamically  varying  environment  in  real-time.  The  efficacy  of  the 
scheme  is  illustrated  through  a  comprehensive  computer  simulation 
and  experimental  study.1 

The  paper  is  organized  as  follows.  Section  II  presents  the  collision 
avoidance  scheme  and  provides  a  brief  summary  of  various  imple¬ 
mentation  issues.  The  collision  avoidance  strategy  is  illustrated  in 
Section  m  through  a  computer  simulation  study  and  in  Section  IV 
through  experiments  with  a  Robotics  Research  7  DOF  arm.  Finally, 
Section  V  summarizes  the  paper  and  draws  some  conclusions. 

n.  Collision  Avoidance  Strategy 

Consider  the  problem  of  maneuvering  a  kinematically  redundant 
manipulator  in  a  congested  workspace  in  such  a  way  that  the  desired 
end-effector  trajectory  is  closely  tracked  and,  at  the  same  time, 
collisions  between  the  manipulator  links  and  workspace  obstacles 
are  avoided.  It  is  assumed  that  the  robot  has  low-level  joint  servo- 
loops  capable  of  ensuring  that  the  robot  joint-space  position  0  €  ft n 
closely  tracks  any  desired  trajectory  #,*(£).  Thus  the  above  problem 
becomes  one  of  kinematic  control ,  and  involves  specifying  0d(t)  so 
that  the  end-effector  trajectory  tracking/collision  avoidance  objectives 
are  accomplished  simultaneously. 

A  Overview  of  Control  Scheme 

Let  x  6  ftm  denote  the  position/orientation  of  the  manipulator  end- 
effector,  and  recall  that  m  <  n  because  the  robot  is  kinematically 
redundant  The  forward  kinematics  and  differential  kinematics  for 

1  Recently  another  approach  to  combining  the  strategies  [9],  [1 1]  to  achieve 
inverse  kinematics-based  collision  avoidance  has  been  proposed  in  [12].  Thi2  * 
work  adopts  a  sensor-based  approach  and  is  implemented  and  verified  through 
computer  simulation. 


the  manipulator  can  be  written  as 

X 

1! 

(i) 

x  =  jc(6)e 

(2) 

where  f  :  ftn  —  ftm,  and  Je  G  ftmXn  is  the  end-effector 
Jacobian  matrix.  Our  approach  to  kinematic  control  is  to  utilize  the 
improved  configuration  control  method  described  in  [11].  Briefly,  this 
method  involves  augmenting  the  end-effector  task  with  additional 
user-specified  tasks  and  then  inverting  the  augmented  differential 
kinematics  using  a  damped-least-squares  (DLS)  solution.  Thus  the 
user  defines  the  kinematic  functions 

y  =  g(*)  (3) 

where  g  :  ft"  — ►  3T  and  r  is  the  number  of  kinematic  fractious  to 
be  controlled  as  the  additional  task.  Augmenting  (1)  with  (3)  yields 
the  kinematic  model 


and  the  differential  kinematic  model 

4 =[£<'!]* <5> 

where  z  G  ft(r+m)  is  the  configuration  vector  and  Jc  G  ftrx*  is  the 
Jacobian  matrix  associated  with  the  additional  task. 

The  desired  behavior  of  the  manipulator  can  now  be  specified  by 
defining  the  desired  configuration  vector  zd(t)  =  [xj(* )|yj(f)]  . 
The  DLS  approach  to  computing  the  desired  joint-space  trajectory 
corresponding  to  zd  is  to  compute  6d  from  [1 1] 

ed  =  (Jrny + A2H'v]_1  JTW[zd  +  A'e]  (6) 

and  then  integrate  it  to  obtain  6d.  In  (6),  e  =  zd  -  z  is  the 
tracking  error,  K  €  »*'+"> »«r+m)  ^  a  symmetric  positive-definite 
position  feedback  gain  that  corrects  for  linearization  error  and  ocher 
unmodeled  effects,  and  the  relative  importance  of  dose  trajectory 
tracking  and  low  joint  velocities  is  reflected  in  die  choice  of  die  sym¬ 
metric  positive-definite  weighting  matrices  H'G  (**+*)  and 

Wv  €  ftnXn  and  the  positive  scalar  constant  A.  The  DLS  fonnobooQ 
of  the  configuration  control  scheme  gives  optimal  solutions  to  (5)  that 
are  robust  to  singularities.  This  is  achieved  by  optimally  reducing  the 
joint  velocities  to  induce  minimal  errors  in  the  task  performance  by 
modifying  the  task  trajectories.  Tins  approach  to  kinematic  control  is 
discussed  in  (e.g.,  [1 1],  [13]-[17]),  and  various  properties  and  feafimes 
of  this  scheme  are  presented  in  these  references.  A  few  nAftwni 
facts  concerning  DLS  solutions  that  are  useful  in  our  subsequent 
development  are  summarized  here;  die  derivation  of  these  resides  is 
given  in  the  Appendix. 

first  note  that  the  DLS  formulation  (6)  can  be  written  a  the 
following  equivalent  form  (see  Appendix  A!) 

ed  =  W~1JT[JW~1JT  +  \2W~l]~\zd  +  A'e)  (7) 

The  formulation  (7)  possesses  certain  advantages  over  (6)  wfen 
(r  -f  m)  <  n,  including  enhanced  computational  efficiency  ad 
increased  flexibility  concerning  the  selection  of  W,  WV,  and  A  (for 
example,  A  can  be  set  to  zero,  corresponding  to  die  specification  fet 
the  desired  trajectory  is  to  be  tracked  exactly).  Next  observe  th*  the 
matrices  W  and  Wv  can  be  chosen  to  “scale’’  the  componeas  of 
flgnd  6  and  thereby  provide  a  natural  way  to  handle  diffiedties 
associated  with  mixed  units  (e.g.,  meters  and  degrees),  and  *i<n 
different  emphasis  in  performance  requirements  (e.g.,  high  accancv 
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in  position  tracking  and  low  accuracy  in  orientation  tracking).  For 
example,  letting 

IF  —  diag[(max.  allowed  velocity  error)'2]  =  W1/2W1/2 
Wv  =  diagf(max.  allowed  joint  rate)"2]  =  IFj^PFj^2  (8) 

leads  naturally  to  the  definition  of  the  “normalized”  joint-space  and 
task-space  variables  9d  =  Wl/20d  and  z*  =  W1/2(z<i  +  A'e). 
Note  that  these  quantities  arc  dimensionless  in  view  of  (8),  so  that 
mixed  units  arc  no  longer  a  problem,  and  that  the  definitions  also 
peomt  convenient  incorporation  of  different  emphasis  in  performance 
re*p*rcments  (since,  for  example,  each  component  of  the  task-space 
is  now  scaled  by  the  maximum  permissible  error).  These  definitions, 
together  with  J~  —  JF1/2  JIFT1/2,  permit  (6)  and  (7)  to  be  written  as 

TO,  \*I)~lrTZi  •;  .  (9) 

i.  =rT(rrT +xti)~li-J  ■■'■■■■  ao) 

where  A  reflects  the  relative  significance  of  tracking  errors  and  joint 
velocities.  Note  that  the  definitions  (8)  can  also  reduce  the  problem  of 
selecting  W9  lFr-  K  to  one  of  selecting  the  two  scalars  A  and  k  (since 
the  weighting  on  the  elements  of  e  are  included  in  Wy  it  suffices  to 
use  K  =  klX  The  task  of  specifying  these  parameters  can  be  guided 
by  consideration  of  the  following  two  bounds,  which  are  derived  in 
Appendix  AJ2z 

'  -> 
11*11  <-£  II  *<f  +  *e||  (12) 

where  a min  is  the  minimum  singular  value  of  J.  Note  that,  as 
expected,  ||  e  |  can  be  reduced  by  increasing  k  and/or  decreasing 
A,  and  that  these  adjustments  tend  to  increase  ||  6  ||.  Thus  tracking 
error  requirements  and  joint  rate  limits  can  be  balanced  through  the 
selection  of  k  and  A.  It  should  be  mentioned  that  both  k  and  A  can 
be  accosted  on-line  based  on  the  observed  performance  of  the  robot 
(e.g^  (II],  [14]),  This  typically  leads  to  improved  performance,  and 
if  this  strategy  is  adopted  the  analysis  summarized  above  can  be  used 
to  select  initial  values  or  bounds  for  these  parameters. 


e5<  defined  as  follows:  eSi  =  0  if  gi  >  0,  and  e„i  =  — <7,  if  <  0. 
Then  the  DLS  formulation  (6)  can  be  written  as 

[JTWJ  +  WV+  Ji)0d  =  JTW(id  +  Kef+'£iwij7  e,x 


where  A  ==  1,  —  dg%  f  89  G  3?1  x  n ,  and  the  wi  are  noraoegafive 

scalar  weights  whose  magnitudes  indicate  the  relative  impoctaarc  of 
inequality  gi  >  0.  Note  that  in  the  current  collision  avoidance  scheme 
z^  .  is  set  to  0,  so  that  here  we  are  interested  in  regulating  a  collision 
avoidance  task,  not  tracking  a  desired  evolution.  The  implementation 
(14)  is  illuminating  because  it  makes  clear  how  the  inequalities  (13) 
can  be  introduced  and  then  removed  as  the  task  evofresvja  addition, 
this  formulation  leads  to  a  modular  algorith^^ 
however,  that  the  DLS  scheme.  (7)  can  be  modified  ia^antiogous 
manner,si^taddmgthe‘^ 

to  augmenting  theJac^ian'm^^  .the:.‘ltaS^^ibfc;  cniityector 
when  the  constrains  are  “active”  "and  then  removing^&ese^ terms 
when  the  constraints  are  “inactive  ”  As  indicated  above,  (7)  possesses 
computational  and  implementation  advantages  over  the  more  standard 
result  (6).  These  advantages  are  typically  significant  in  obstacle 
avoidance  applications,  since  in  these  applications  it  is  ushrfy  the 
case  that  (m-fr+p)  <  n  (recall  that  m  is  the  dimension  of  the  task- 
space,  for  example  the  number  of  end-effector  position/oriantation 
coordinates  to  be  controlled,  p  is  the  number  of  collision  avoidance 
constraints  currently  active,  and  r  is  the  number  of  other  additional 
tasks):  *  *  :j.  •  •  •  ■  v  •• 

Finally,  note  that  although  activation  and  deactivation  of  con¬ 
straints  tends  to  be  “damped”  by  the  DLS  scheme,  so  that  abrupt 
changes  in  9d  are  not  observed,  this  activation  and  deactivation  can 
be  smoothed  and  the  overall  performance  can  be  enhanced  by  defining 
the  weights  wi  to  be  smooth  functions  of  the  errors  e9i  as  follows: 


L[1  -  cosfl’(e,j/em0i)]  0  <  e#i  < 

r  e#i  >  emar 


with  Wmax,  emax  selected  based  on  the  particular  application  and 
overall  system  geometry  (note  that  emax  is  not  the  maximum 
permissible  error,  but  rather  the  error  magnitude  beyond  which  the 
weight  Wi  is  set  to  its  maximum  value). 


B.  Smtmary  cf  Collision  Avoidance  Scheme 

The  real-time  collision  avoidance  strategy  described  in  this  paper 
is  implemented  within  the  DLS  configuration  control  framework 
summarized  in  Section  Q-A  and  consists  of  two  steps.  The  first  step 
ntilggs  an  efficient  recursive  algorithm  to  formulate  the  requirement 
that  wo  robot  link  should  contact  any  workspace  obstacle.  This 
rapHement  can  be  expressed  as  a  set  of  task-space  inequality 
constraints  of  the  form 

gW  >  0  (13) 

where  g  :  3£n  — ►  concisely  quantifies  the  collision  avoidance 


C.  Geometrical  Calculations 

In  order  to  realize  the  objective  of  real-time  collision  avoidance 
using  modest  computing  power,  we  utilize  geometrically  simple 
models  for  objects  in  the  workspace  and  few:  the  manipulator  itself, 
and  adopt  an  efficient  recursive  strategy  for  computing  the  necessary 
geometrical  relationships  between  these  object  models  and  the  ma¬ 
nipulator  model.  Thus,  while  we  use  the  full  robot  kinematic  model 
for  control  purposes,  we  shall  approximate  the  RRC  arm  geometry 
with  three  line  segments  for  all  collision  avoidance  calculations;  these 
segments  are  the  “fictitious  links”  connecting  frames  {1},  {3}*  {5}, 
and  {T},  where  { T }  is  the  user-defined  tool  frame  (see  Fig.  2(a)). 
Examination  of  the  actual  kinematics  of  the  RRC  arm 


requirement  Constructing  these  inequalities  for  a  general  manipulator  that  this  simple  model  represents  a  reasonable  compromise  between 
moving  in  a  workspace  cluttered  with  convex  obstacles  contained  geometrical  fidelity  and  computational  simplicity.  This  model  implic- 
in  spheres  is  described  in  [9].  An  efficient  method  for  computing  itly  assumes  the  existence  of  sufficient  “buffer”  regions  surrounding 
these  inequalities  for  anthropomorphic  robots  maneuvering  through  all  obstacles,  to  account  for  the  physical  dimensions  of  the  robot 
openings  is  presented  in  Section  II-C.  Thus  the  buffer  thickness  is  defined  to  be  the  maximum  discrepancy 

i-  The  second  step  in  the  collision  avoidance  scheme  is  to  ensure  between  the  physical  dimension  of  the  robot  and  the  sm^fified 
satisfaction  of  (13)  while  tracking  the  desired  end-effector  trajectory,  “stick-figure”  robot  (see  Fig.  2(a)),  and  this  buffer  thickness  is 
This  objective  is  achieved  by  slightly  modifying  the  DLS  confi^  “  ^orporated  into  the  model  for  all  workspace  obstacles.  In  view  of 
uratioa  control  strategy  to  include  inequality  constraints.  Observe  the  nature  of  this  construction,  it  is  recommended  that  the  task  be 


that  the  inequality  constraint  (13)  can  be  incorporated  into  the  DLS  aborted  if  the  constraint  error  eSi  ever  becomes  so  large  that  this 
configuration  control  framework  by  introducing  the  constraint  error  buffer  region  is  violated  In  what  follows,  four  classes  of  obstacles 
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Fig.  3.  Collision  avoidance  parameter  descriptions. 


^  (®)  Robotics  Research  Corporation  K-1207  redundant  manipulator 
approximation,  (b)  Collision  avoidance  task  description.  ■  ; 


are  considered:  spheres, -drenlar  ripenings:  triangnlar^openingc  and 
rectanguIar?openings.  Note  that  byappropriately  selecting  dimencionc 
and  combining  these  “object  primitives  ”  a  large  class  of  workspace 
objects  can  be  modeled  with  reasonable  accuracy. 

The  results  summarized  in  the  previous  two  sections  give  a 
means  of  simultaneously  tracking  the  desired  end-effector  trajectory 
and  avoiding;  arm  Collisions  with  workspace  obstacles  provided 
that  the  terms  eJ(,  J,-  (i  =  1, 2,...,p)  in  (14)  can  be  computed 
efficiently.  Algorithms  for  The  t^culatidn  <rf  quantitiw  for 
Clerical  obstacles  are  given  in  [9].  The  remainder  of  this  section 
is  focused  oncalculatingthese  tennsfortheRRC  imanipniatnr 
maneuvering  through  circular;  triahgular.'aM  rectahgnlar  op 
Consider  the  schematic-rifthc  TtRC  arm  maneuvering  through  a 
circular  opening  in  a  planar  structure  shown  in  Tig  2(b).  We 
assume  Oat  the  RRC  arm  forward  kinematics  is  available  in  the 
form  of  homogeneous  transforms  ?r(0)-Vt ;  wbere^O)  is  the  user- 
defined  worid  frame -‘Thus  tiie  positi^'<rf!ihe  arighs'ef  frames 
{!}» {3},  {5},  {r}  arc  known;  and  -we  denote  these  ’positions  by 
0xu0x„0  x*,0x4  €  u.v.  MjvpJT*  «*.  ' 

For  clarity  of  exposition,  we  idealize  the  probkm  of  maneuvering 
through  an  opening  with  a  robot  by  considering  an  infinite  planar 
surface  widi  one  opening  possessing  the  shape  of  a  circle,  a  triangle, 
OT  a  rectangle.  Note  that  this  problem  is  nrfiiaRy  fairly  general  and 
thaffts  solution  can  readily'be  extended  to  other  opening  geometries 

sin^e  opening  is  conservative  in  that,  for  instance,  ft  does  not  permit 
an  “elbow”  to  back  into  another,  fortuitously  located,  opening).  In  the 
case  of  a  circular  opening,  we  assume  the  geometry  is  parameterized 
witii  the  triple  °XoP,r,°  n,  where  9XoP  6  ft3  locates  the  center  of 
tire  opening,  r  =  [r0  r,]T  £  3?2  gives,  the  opening  outer  and  inner. 

radii,  and  °n  €  Sf3  is  the  unit  vector  normal  to  the  surface  on  the  {0}  2 

side  of  the  planar  surface.  For  triangular  or  rectangular  openings,  it 
is  assumed  that  the  vertices  of  the  polygon  are  known.  In  these  cases. 


the  vertices  can  be  readily  utilized  to  compute  °E  and  one  vertex  can 
be  (arbitrarily)  identified  as  °x<,p  [18]. 

Our  objective  is  to  ensure  that  the  robot  tracks  the  desired  cad- 
effector  trajectory  while  avoiding  collisions  with  either  the  [J«»» 
surface  or  the  sides  of  the  opening.  We  consider  the  problem  of 
avoiding  contact  with  the  planar  surface  first  A  consequence  of 
^  assuming  “large”  surfaces  is  that  the  set  of  points  with  the  paternal 
v  *or  colliding  with  the  surface  is  reduced  to  °Xi  ,0x2,oX3,°x,.  Wc 
-  define  ’  . 

di  =°  nr(°x,  -°  XoP)  t  =  1,2,3, 4  06) 

to  be  the  (signed)  distances  from  die  planar  surface  to  the  °xi,  and 
note  that  d,-  >  0  indicates  that  °x^  is  on  the  {0}  side  of  the  plane;kis 
assumed  that  initially  d,  >  0  Vi.  Our  approach  to avoiding  cdfiaons 
with  the  planar  surface  is  to  monitor  die  d,  as  die  task  evolves,  and 
aUow  this  infonnation  to  dictate  subsequent  action.  For  dcampfc,  if 
di  <  0  and  dt+1  >  0  then  it  is  assumed  that  °x,:  is effiding  wih 
the  surface  and  is  not  attempting  io  maneuver  through  (be  npr^' 
In  this  case,  eMi  =  — d,,  °fi  =°  n,v  and  die  scheme  developed 
for  spherical  obstacles  Is  direedy  relevant  here  [9].  Ahemathefy, 
if  d,*  >  0  and  d^i  <  0,  it  is  assumed  that  an  Mtempr  is  ^de 
to  maneuver  through  the  opening,  and  die  strategy  developed  bdkm 
for  avoiding  collisions  with  the  opening  is  invoked.  lmpfidt  ia  tts 
procedure  is  the  assumption  that  °X4  (dial  is,"{ir})  is  being  dSrectcd 
in  a  reasonably  well-informed  manner.  Note  that  in  the  ideal  case 
in  which  die  above  assumptions  (zero  thickness  links,  openings  m 
planar  surfaces,  well-informed  end-effector  motion)  are  satisfied,  fc 
ptx^osed  strategy  will  always  detect  collision  when  it  is  eminent.  la 
practical  cases,  this  approach  permits  the  development  of  a 
real-time,  control-level  approach  to  collision  avoidance  which  vl 
work  for  a  large  dass  erf  obstacles.  V=  > 

Our  approach  to  avokfing  coflisionsbetweca  tfteann  andthesfc 
of  die  opening  is  to  utilize  die  d,-  computed  in  (16)  to  identify  a 
critical  link.  More  specifically,  if  dt  >  0  and  di+I  <  0  foraay 
*,  then  die  link  i  connecting  °x,  and  °x,+1  is  a  critical  link  mi 
potentially  contains  a  critical  point  (see  fig.  3  for  an  fflustrfioo 
°Xc;  s  precise  definition  of  this  term  is  given  in 
(17)  below,  and  additional  details  regarding  die  nature  and  role  of 
critical  points  in  collision  avoidance  are  provided  in  [9D_  Thus  a  U 
contains  a  critical  point  candidate  once  it  has  intersected  die  plane  of 
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fi£_  4_  Graphical  depiction  of  RRC  arm  and  truss  structure. 


the  opening.  Given  the  assumption  that  the  obstacle  parameterizations 
are  known,  we  desire  to  compute  the  following  quantities  for  each 
link  the  location  of  the  critical  point  candidate  xCi  £  9ft3  for  the  link 
t\  the  distance  dCi  £  9ft+  between  x^  and  Xo?,  and  the  unit  vector 
nf  E  R3  pointing  from  xCi  to  XoP  (see  Fig.  3).  These  quantities 
can  be  computed  recursively  starting  from  the  robot  base.  Note  that 
if  more  than  one  link  is  critical  it  is  recommended  that  the  task  be 
aborted. 

Q»ce  a  critical  link  has  been  identified,  it  is  necessary  to  determine 
the  terms  e3i  and  Jt  required  by  the  control  scheme.  Because  the 
nature  of  these  calculations  varies  with  the  geometry  of  the  opening, 
we  consider  the  three  types  of  openings  separately. 

Grcnlar  Opening 

Here  the  location  of  the  critical  point  candidate  is 

°xCi  =°  x,  +  a°e,  (17) 

where  °et  =  (°xt+i  -°  xt )/  ||°  x1+1  -°  xt  || 

a  =  -di/°eT°  fi 

Then  the  critical  distance  is  dCi  =  rt—  ||°  Xct  — 0  XoP  ||,  the  critical 
point  candidate  is  a  critical  point  if  dCi  <  0,  and  if  the  point  is 
critical  then  eSi  =  —  dCi .  The  corresponding  row  of  the  Jacobian  for 
this  point  can  then  be  calculated  using  the  following  algorithm: 

1)  Define  J~*  £  9ft3*7  as  °xCi  =  Jmm 0  and  compute  its  columns 
JJ*  =°  zj  x  (0Xc-  — 0  Xj )  where  °z j  is  the  2 -axis  unit  vecto£  - 
for  {j}  and  it  is  understood  that  J**  =  0  for  each  j  >  i. 

2)  Define  nt  =  (0XoP  — °xCi)/  ||°  XoP  -°  ||  and  compute  the 

Jth  element  of  J,  as  Jij  =  nf  JJ*,  j  =  1,2,...,  7. 


This  algorithm  is  computationally  simple,  recursive,  and  robust, 
and  examples  of  its  implementation  are  provided  in  Sections  IE  and 
IV. 

Triangular  and  Rectangular  Openings 

Here  the  critical  point  candidate  can  be  located  using  the  schesae 
(17)  derived  for  circular  openings,  but  the  determination  of  e9t  2nd 
Ji  is  slightly  more  involved  because  these  openings  do  not  possess 
the  symmetry  of  a  circle.  Our  basic  approach  to  determining  these 
quantities  is  to  specify  local  coordinate  systems  at  the  polygon 
vertices,  using  an  off-line  calculation,  and  then  to  represent  °xCi 
in  these  new  frames;  once  these  coordinate  transformations  have 
been  completed,  e5i,  n,  can  be  computed  easily,  and  the  Ji  can  be 
constructed  using  the  algorithm  given  for  circular  openings  [18J.  It 
should  be  mentioned  that  the  scheme  developed  few  triangular  and 
rectangular  openings  actually  works  for  any  convex  polyhedron. 

In  all  cases  where  we  desire  the  manipulator  to  reach  through 
an  opening,  the  collision  avoidance  scheme  outlined  above  provkies 
an  efficient  and  flexible  environment.  Flexibility  can  be  achieved 
by  noting  that  the  scheme  can  be  modified  for  model-based  or 
sensor-based  manipulator  control.  In  model-based  control  where 
information  is  available  regarding  the  environment,  the  collision 
avoidance  scheme  described  above  allows  the  robot  maximal  use 
of  its  workspace,  since  obstacles  are  treated  as  inequality  constraints. 
For  example,  the  collision  avoidance  scheme  for  a  triangular  opening 
Wifizes  knowledge  of  the  location  of  each  of  the  three  vertices  of 
the  triangle,  and  in  this  case  the  manipulator  has  the  benefit  of  a 
large  workspace  at  the  expense  of  requiring  extensive  information 
regarding  the  world.  If  less  information  is  available,  ne  triangular 
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Kg.  5.  (a)  Response  of  cod-effcctor  coordinate  y  in  simulation  example;  (b) 
response  of  end-effector  coordinate  x  in  simulation  example;  (c)  critical  point 
distance  dc.  in  simulation  example 


opening  can  be  modeled  as  a  point  in  die  plane.  Then  the  collision 
avoidance  scheme  for  a  circular  opening  can  be  used  with  a  zero  inner 
radius  anda  small  outer  radius.  This  has  the  effect  of  defining  a  region 
that  die  manipulator  .critical  point  .  should Upassr^uvugL  Ecn  modi 
less  information  is  needed  regarding  the  boundary  of  the  opening, 
but  die  effective  workspace  of  the  robot  is  decreased.  _ 

.  DDL  Simulation  Results  ...  r 

The  robot  collision  avoidance  strategy  described  in  Section  II  i£ 
now  applied  to  the  RRC  manipulator  (7DOF)  mounted  on  a  mobile 
platform  (1DOF)  in  a  computer  simulation  study.  The  simulation 


Fig.  6.  Control  hardware  diagram. 

environment  consists  of  a  Personal  Iris  Graphics  Workstation,  the 
Hydra  graphics  rendering  package  developed  at  JPL.  and  the  software 
necessary  to  realize  the  DLS  configuration  control  implementation  of 
the  collision  avoidance  strategy. 

For  simulation  of  control  algorithms  and  for  previewing  morions 
of  the  manipulator-plus-platfonn  system,  a  graphical  simulation  ca¬ 
pability  has  been  developed  on  the  IRIS  Workstation.  The  Hydra 
graphics  rendering  package  allows  the  user  to:  (1)  btrild  graphical 
models  of  typical  robot  components  and  workspace  structures,  (2) 
input  relative  motions  to  the  components,  and  (3)  change  the  viewing 
perspective  of  the  workspace  of  the  robot  Thus,  the  combmarioa  of 
the  DLS  control  implementation  of  the  collision  avoidance  scheme 
and  the  Hydra  graphics  rendering  capabilities  allow  for  compirte 
robot  system  simulation. 

The  objective  of  the  simulation  example  is  to  illustrate  the  cothaon 
avoidance  capability  of  the  RRC  manipulator  and  mobile  pbtfonn 
for  the  task  of  reaching  inside  the  truss  structure  depicted  graphically 
in  Kg.  4.  This  truss  structure  is  one  of  die  components  in  Ae  RSI 
Laboratory  at  JPL,  and  in  this  example  we  desire  to  aMer  the  mtcooc 
of  the  truss  and  then  move  around  in  the  interior  to  tri  nr1^  an 
internal  inspection  task.  Thus  this  tad:  requires  the  robot  to  tack 
a  desired  position  trajectory  while  simultaneously  avoiding  coKaoo 
with  the  beams  of  the  truss  openings.  « 

In  this  simulation,  the  DLS  configuarion^contpoi  approach  to 
obstacle  avoidance  summarized  in  Section  II-B  is  imptemeatod  with 
the  8  DOF  system.  Here  die  triangular  opening  pass-daough  mefood 
is  employed  to  mode!  the  situation  in  which  reduced  jnforwwinn 
is  available  regarding  die  truss  structure  geometry..  -The  trimpdar 
opening  in  the  truss  has  vertices  °pi  =  [~3j08  —0.86  = 

(-3.08  -0.86  0.55]t,°P3  =  (-3.96  -086  0-55jT  (relative  aatfie 
user-specified  world  frame  and  with  meter  die  unit  of  lengfr)  and 
through  simple  calculation  the  center  point  equidistam  from  the  sides 
of  die  triangle  is  found  to  be  °Xop  =  [-3.29  —  0.86  0.77}^  The 
outer  and  inner  raefii  are  defined  as  r  =  {0.09  0.05]T,  is  ret  as 
etnax  =  0.04m,  and  die  obstacle  surface  normal  is  =  JO  ldjT. 
The  inner  radius  is  relaxed  (from  n 0.0  to  r«-  =  0U05)  to  alow 
the  robot  to  make  larger  use  of  die  workspace  to  complete  the  tide 
once  a  link  is  critical  it  is  allowed  to  deviate  five  centimeters  from  Ae 
“pass-through”  point  before  the  obstacle  avoidance  task  is  acfhafcxL 
Given  die  modeling  assumptions  and  the  buffer  thickness,  la  die 
worst  case  the  upper  arm  of  the  manipulator  would  be  permitted 
fcMpproach  the  truss  structure  to  within  5  centimeters  before  the  tad: 
is  aborted  In  tins  simulation,  from  an  initial  end-effector  pashm 
of  x  =  [—3.29  —  0.80  0.77] r  we  want  the  robot  to  move  - 
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Q50m  in  tbe  ^-direction  and  then  -0.162m  in  the  x -direction  and 
-®iI8m  in  die  z-cfirection;  thus  the  arm  will  reach  0.45  meters 
xiz&de  the  trass  and  then  move  0.20  meters  parallel  to  the  diagonal 
dbnent  of  tk  truss.  The  desired  end-effector  trajectory  is  given  as 
x*(f)  =  [OlO  —  0.50(f/r  —  sin(2nt/T)/2x)  0.0]T  for  t  C  [0,1.25], 
z  —  1.25w  an d  Xd(t)  =  [—0-162 (f/r  —  stn(27rt/r)/2z)  0.0  — 
flLH8(f / T  p-  »n(2af / r) / 2tt)] t  for  t  C  [1.25,2.50],  r  =  1.25.  To 
teat  the  Cartesian  positioning  capability  of  the  ann/platform  system, 
fe  orientation  control  is  turned  off  for  this  simulation.  Thus,  the 
Cartesian  task  dimension  is  defined  as  m  —  3.  When  the  collision 
avoidance  inequality  constraint  is  activated,  r  =  1  and  the  task- 
iface  DOF  are  increased  to  m  -j-  r  =  4.  The  weights  for  the 
DLS  confirmation  control  strategy  (14)  are  chosen  as  K  =  100, 
IT  =  lOQJKV  =  I»  and  wmax  =  1000.  Figs.  5(a)  and  5(b) 
Aaw  the  ewotatxh  trf  the  teyectory  in  die  y  and  z  directions, 
npectivcfy.  and  Bgl 5(c)  shows  the  critical  point  distance  d*-  for? 
fccoQiskrawvoklance  subtask.  From  these  figures;  it  is  seen  that  the  ~ 
end-effector  task  and  the  collision  avoidance  requirement  are  satisfied 
anoltaneomdy  with  very  little  error  in  the  end-effector  trajectory 
tacking.  ^  ^  ;  v 

-  iv.  Experimental  Results 

hi  this  section,  we  describe  the  basic  features  of  the  manipu- 
fator  control  system  in  the  JPL  RSI  Laboratory  and  summarize 
an  experimental  study  using  die  collision  avoidance  scheme.  The 
uripul  ator  system  is  shown  in  Fig.  1  and  a  control  hardware  diagram 
is  shown  in  Fig.  6.  The  experimental  environment  consists  of  an 
RRC  Model  K-1207  (7DOF)  ann/control  unit,  a  VME  chassis  with 
fite  Heurifaow  MC68020  processor  boards  and  additional  interface 
cards,  two  joysticks,  a  motorized  platform/control  unit  on  which  the 
no  is  matatoed,.2nd  a  Silicon  Graphics  IRIS  Workstation.  The  truss 
sfeocture  in  Ibe  Laboratory  has  eight  triangular  openings,  and  in  this 
experiment  me  want  die  robot  to  enter  the  truss  and  track  a  trajectory 
ieide  the  tress  structure. 

As  shown  in  Fig.  2(a),  the  dexterous  manipulator  has  seven  revolute 
joints  in  an  afcrmntriig  roll/pitch  sequence  beginning  with  the  shoulder 
roff  at  the  base  and  ending  with  the  tool-plate  roll  at  the  hand.  The  arm 
pedestal  is  mounted  on  a  motorized  mobile  platform  which  provides 
cue  additional  translational  degree-of-freedom  that  can  be  treated 
s  a  prismatic  joint  The  complete  manipulator  system  therefore 
k ®  eight  independent  joint  degrees-of-freedom;  the  system  has  two 
degrees-of-redundancy  since  six  joints  are  sufficient  for  die  basic 
t&fc  of  end-cffector  position  and  orientation  in  the  three-dimensional 
workspace.  Although  the  real-time  system  is  capable  of  moving  die 
platform,  unlike  in  the  simulation  environment  the  platform  is  not 
tested  as  an  8th  degree-of-freedom  since  die  present  serial  interface 
to  tfceplatfonn  is  loo  slow  for  the  platform  to  be  considered  as  a 
trae  joint.  -  “ 

The  Robotics  Research  arm  is  controlled  by  a  real-time 
Hcroprocessor-based  controller  that  interfaces  directly  with  the 
Moltibus-based  arm  control  unit  supplied  by  the  manufacturer 
(see  Fig.  6X  The  real-time  controller  is  a  VMEbus-based  system 
fa  uses  five  Motorola  MC68020  processors  along  with  various 
daa  acquisition,  memory,  and  communication  devices.  The  VME 
controller  is  finked  via  socket  communication  to  the  Silicon  Graphics 
IRIS  Workstation,  which  serves  as  the  host  computer  for  the  graphical 
wer  interface.  All  of  die  control  software  is  written  in  the  C  language 
and  executes  on  the  VME  controller  running  the  Vx Works  real-time 
operating  system.  The  real-time  VME  chassis  communicates  with  the 
am  control  unit  (Multibus  chassis)  via  a  two-card  VME-to-MuItibils  ~ 
adaptor  from  the  Bit  3  Corporation.  This  enables  joint  set-points 
to  be  sent  to  the  arm  control  unit  at  a  high  speed  (400  Hz)  via 
a  shared  memory  servo-level  interface.  Thus,  the  arm  control  unit 
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Fig.  7.  (a)  Response  of  end-effector  coordinate  y  in  experiment  one;  (b) 
response  of  end-effector  coordinate  x  in  experiment  one;  (c)  critical  point 
distance  dCi  in  experiment  one. 


is  a  low-level  joint-space  position  controller  with  the  higher  level 
control  being  done  on  the  VME  chassis.  The  high-level  control 
loop,  which  consists  of  a  trajectory  generator,  forward  kinematics, 
and  DLS  configuration  control  algorithm,  runs  on  a  single  Heurikon 
68020  board  at  44  Hertz  (22.7  milliseconds).  The  portion  of  this  time 
used  to  calculate  the  collision  avoidance  task  is  35  milliseconds. 
The  other  four  processors  are  used  for  servicing  hardware  devices. 
These  hardware  devices  are  an  Ethernet  board  (for  user  input), 
an  analog  input  board  (for  the  joysticks),  a  serial  board  (for  the 
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Fig.  8.  (a)  Response  of  end-effector  coordinate  y  in  experiment  two;  (b)  response  of  end^ffector  cootdmte  x  in  experiment  two;  (c)  remoose  of  endefixmr 
coordinate  z  in  experiment  two;  critical  point  distance  dCi  in  experiment  two. 


mobile  platform),  and  a  Bit3  board  (for  communication  with  the 
RRC  controller).  '  ;  -*  : 

The  DLS  configuration  control  implementation  of  obstacle  avoid¬ 
ance  described  in  Section  II  was  slightly  modified  in  these  experi¬ 
ments.  To  simulate  die  8th  degrec-of-freedom,  the  obstacle  avoidance 
code  monitored  the  horizontal  ^-component  of  the  critical  distance; 
this  is  the  component  parallel  to  die  platform  tfirection  of  motion.  If 
the  x-compcraentoftbe  critical  distance  exceeds  0.10m,  a  command 
is  issued  to  move  die  platform  an  amount  equal  to  die  x-component 
of  the  critical  distance,  awhile  simultaneously  commanding  the  end- 
effector  to  compensatory  die  same  amount  in  die  opposite  direction. 
The  net  effect  is' to  leave  dw'eadH^fortor^ position  die  same,  wiiile 
attempting  to  lessen  the  horizontal  component  of  the  critical  distance. 
This  initial  implementation  provides  -a  -simple  heuristic  model  for 
simulating  the  8  degrees-of-fieedom  with  the  current  experimental 
setup.  In  die  experimental  study,  many  different  tasks  were  studied. 
Here  two  examples  are  chosen  for  presentation  to  illustrate  the  flexi¬ 
bility  of  the  collision  avoidance  scheme  and  die  DLS  implementation 
of  configuration  controLTn  each  example,  the  collision  avoidance 
strategy  is  implemented  using  the  triangular  opening  pass-thmugh 
model  of  the  obstacle  and  the  obstacle  geometry  and  die  initial 
configuration  of  the  arm  and  platform  are  exactly  the  same  as  in 
the  simulation  study.  : 

In  the  first  experiment,  from  an  initial  end-effector  position  of 
^  =  [—3.29  *—  0.80  0.77] T  the  end-effector  is  commanded  to  mov<?  ‘ 
-0.50m  in  the  y-direction  and  then  —0.162m  in  the  x-direction 
and  —0.118m  in  the  z -direction,  and  orientation  control  of  the 


end-effector  is  not  activated.  This  motion  will  cause  the  n  to 
reach  0.45  meters  inside  the  truss  and  then  move  parallel  to  fc 
diagonal  of  the  triangular  opening.  Thus,  the  trajectory  is  spotted 
to  produce  die  same  motion  as  in  die  simulation  example  only  at 
a  slower  rate.  The  desired  end-effector  trajectory  is  specified  as 
M*)  =  [0.0  -  0.50 (f/r  -  8in(2Kt/r)/2x)  0.0jr  for  t  C  flLSJIi, 
t  =  5.0,  and  x*(f)  =  [— 0.162(*/t  —  «»(2xt/r)/2x)OL0  — 
0.118(t/r  -  «n(2xf/r)/2x)]r  for  t  C  [5.0,10.0],  x  =  5.0.  The 
wdghts  for  the  DLS  configuration  control  strategy  (14)^5)  me 
chosar  as?if  =^0.  W  «  LO,  J^=rf UW5^  and  £ 

^  W  show  the  evolution  of  the  trajectory"  Inttey 
and  x-directicmi  relatively,  and  7(c)  shows  the  critia&pattt 
distance  dCi  for  the  collision  avoidance  subtask. These  figures  show 
that  trajectory  tracking  is  reasonable,  with  the  end-effector  tiaJjag 
die  desired  trajectory  to  within  0.01m  in  die  x  and  y-tfirections.sd 
0.02m  in  the  x-direction.  The  obstacle  avoidance  algoridm  mi'urii 
the  critical  distance  within  a  band  of  0.012m  (the  critical  is 

zero  if  the  critical  point  is  within  the  inner  radios  of  O  Q5m)  _ 

In  the  second  experiment,  an  inspection-type  task  is  created.  The 
first  two  moves  will  preposition  the  arm  for  the  intemal  SnyyAif  Q| 
the  first  move,  the  arm  enters  the  truss  under  position  *nrl  odmaui 
control.  For  the  second  move,  the  wrist  joint  is  pitched  down  Hi 
the  xt/ -plane  of  the  tool  frame  is  parallel  to  the  xy-plane  of  the  world 
frame.  The  internal  inspection  task  comprises  the  next  four  moves, 
frore  the  end-effector  task  is  to  trace  out  a  square  under  posrtiooaad 
orientation  control,  thereby  simulating  the  inspection  of  a  flat  qrfetry 
inside  the  truss.  The  initial  end-effector  position  is  identical  to  tte 
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firc*  experiment.  The  first  move  inserts  the  arm  into  the  truss  and  is 
(feiaed  by  xj(t)  =  [0.0  —  0.4 o(t/r  —  sin(2nt/r)/2ir)  0.0]T 
for  t  C  [0,10.0],  r  =  10.0.  The  second  trajectory  pitches 
foe  wrist  down  while  moving  the  y  and  ^-coordinates,  accord¬ 
ing  to  x<j(f)  =  [0.0  0.15(f/r  —  sin(2*t/r)/2n)  0.15 (t/r  — 
si*{2zt/T)/2jr)]T  for  t  C  [10.0,20.0],  t  =  10.0.  The  inspection 
mate  trajectories  trace  out  a  square:  xj(t)  —  [0.0  —  0.10(t/r  - 
««(23rt/r)/2ar)  0.0]r  fort  C  [20.0,25.0],  t  =  5.0 ,  xj(t)  = 
{— ftl0(t/r  -  sin(2xt/T)/2ir)  0.0  0.0]T  for  t  C  [25.0,30.0], 
t  =  5.0,  Xd(t)  =  [0.0  0.10(t/T  -  sui(2trt/r)/2tr)  0.0]r 
far  t  C  [30.0,35.0],  r  =  5.0,  and  xa(f)  =  [0.10(t/r  - 
*M(2jrt/r)/2jr)  0.0  0.0]r  for  t  C  [35.0,40.0],  r  =  5.0.  fig. 
8(ft  8(b),  and  8(c)  show  the  evolution  of  the  cnd-effector  trajectory 
in  foe  y,  x,  and  ^-directions,  respectively,  and  Fig.  8(d)  shows  the 
critical  point  distance  for  the  collision  avoidance  subtask.  Again 
the  figures  show  that  the  obstacle  avoidance  task  is  achieved  by 
keeping  die  critical  distance  within  0.014m.  End-effector  trajectory 
tnrfmg  is  good  for  the  majority  of  the  trajectory  except  when 
f  =  15  secs  and  again  at  t  =  32  secs,  where  tracking  errors  in  x 
is  observed.  At  these  times,  the  critical  distance  plot  shows  that  the 
DLS  algorithm  optimally  sacrifices  end-effector  trajectory  tracking 
to  ensure  satisfaction  of  the  collision  avoidance  constraint 

V.  Conclusions 

This  paper  presents  an  efficient,  flexible,  and  robust  scheme  for 
ieal-time  collision  avoidance  for  redundant  manipulators  at  the  joint- 
control  leveL  The  control  scheme  is  derived  at  die  inverse  kinematics 
level  using  a  DLS  implementation  of  configuration  control,  and  is 
intended  for  implementation  with  a  high-level  path  planning  system 
or  with  a  tdcopeiation  system.  The  collision  avoidance  strategy  can 
be  modified  for  use  with  a  variety  of  obstacle  geometries,  including 
soGd  obstacles  and  openings.  The  configuration  control  framework 
rise  allows  the  user  to  trade-off  end-effector  tracking  with  collision 
avoidance  in  cases  when  both  tasks  cannot  be  achieved  accurately. 
Additionally,  the  scheme  provides  the  user  with  the  capability  to 
vary  die  level  of  information  needed  by  the  algorithm,  so  that  the 
approach  is  equally  applicable  to  model-based  control  or  sensor-based 
coatroL  The  algorithm  presented  here  has  been  tested  extensively  in 
amnlations  as  well  as  in  experiments,  and  the  results  of  these  studies 
confirm  the  efficacy  of  die  method. 
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permit  (Al)  to  be  rewritten  as  ,  - 

minimize^2  ||  4>  ||2  +  il  uS  ||2  +  ||  f  -2ufj*«}  (A2) 


which  admits  the  solution 

j> = (j*r r + A2jf 1  rTi&  ;  (A3> 

=  JmT(J*rT  +  A2/)_1ri3  (A4) 

where  the  equivalence  of  (A3)  and  (A4)  was  first  noted  in  [14].  The 
substitution  of  the  relations  —  Wx(id  +  = 

WxJWq1  into  (A4)  then  yields  (7). 

A 3,  We  now  derive  the  bounds  (11),(12)  for  the  DLS  schrmrs 
(9),(10).  We  proceed  by  examining  die  convergence  properties  of 
the  DLS  scheme  (9),  rewritten  here  m  a  slightly  simplified  form  fiar 
convenience  of  analysis:  \  ‘  -  ^ 

-  -8  =  [JTJ  +  X2rflJT(id  +  ke)  -  (AS) 

The  error  dynamic  equation  that  results  from  applying  foe  control 
input  6d  calculated  using  (A5)  to  die  system  (5)  is 

e  =  -kJ{JTJ  +  A2/)- 1  JTe  +  (I  -  J(JTJ  +  A2Jj_IJT)ii 

(A 6) 

where  it  is  assumed  that  a  good  position  controller  is  avribfcfe  so  that 
9  =  0d.  The  bound  (1 1)  is  derived  by  using  at  Lyapunov  analysis, 
and  toward  this  end  we  consider  the  following  Lyapunov  function 
candidate 


V  =  eT  e/2. 


Differentiating  V  along  die  error  dynamics  (A 6)  yields 
V  =  —keT[J(JTJ  +  A2/)-1  JT]e  +  eT[I  -  J(JT  J + X*I)~l 

s-tCTl",,+OTll“"e|1 


where  <Tmin  is  the  minimum  singular  value  of  J.  The  upper  bound  m 
V  given  in  (A7)  implies  that  e  is  bounded,  and  permits  foe  foflow^g 
upper-bound  on  []  e  ||  to  be  established: 


l|e||< 
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Appendix 

Some  Properties  of  the  DLS  Configuration  Control  Scheme 

In  this  Appendix,  some  properties  of  die  DLS  configuration  control 
approach  to  kinematic  control  are  briefly  summarized. 

XI  We  first  derive  the  equivalent  formulation  (7)  of  the  DLS 
scheme  (6),  and  begin  by  noting  that  (6)  is  the  solution  to  the 
following  optimization  problem: 

minimize  {A2#J WvBd  +  (id  4-  Ke  —  J$d)  IV  (id  +  ATe  —  JQd)\ 

(Air 

Observe  that  the  factorization  W  —  Wj  IF,  and  Wv  =  Wj  IV# 
arc  possible  since  Wv  =  Wj  >  0  and  W  =  WT  >  0.  Then  die 
definitions  £  —  VF^.uJ  =  Wx(id  +  A'e),  and  J*  =  WXJW^1 


To  obtain  the  bound  (12),  we  note  from  (A5)  that 

MB  =11  (JT-J  +  (i« + fce) l -  .%. 

where  J  —  UHVT  with  each  ut,v,  defined  by  U  — 
IuiU2***u1^r]  and  V  =#  (viv2--vwJ.  From  (Aft,  it  cm 
be  seen  that  foe  maximum  ||  6  ||  will  occur  when  (id  +  be)  afigus 
with  one  of  the  u,  and  <n  =  A,  which  leads  to 

B^ll<^l|i-  +  fcel- 
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A  New  Adaptive  Control  Algorithm  : 
for  Robot  Manipulators  in  Task  Space 

Gang  Feng,  Member  IEEE 

Abstract —  Adaptive  control  of  robotic  manipulators  in  ta&  space 
coordinates  is  considered  in  this  paper.  A  new  composite  adaptive  con¬ 
trol  law,  which  uses  the  prediction  error  and  tracking  error  to  drive 
parameter  estimation,  is  developed  based  on  sliding  mode  and  a  general 
Lyapunov-like  concept.  It  is  shown  that  global  stability  and  convergence 
can  be  achieved  for  the  adaptive  control  algorithm.  The  algorithm  has 
the  advantage  that  inverse  of  Jacobian  matrix  and  the  bounded  reverse 
of  the  estimated  inertia  matrix  are  not  required.  The  algorithm  is 
further  modified  so  as  to  achieve  robustness  to  bounded  disturbances. 
A  simulation  example  is  provided  to  demonstrate  the  performance  of  the 
proposed  algorithm. 

L  Introduction  *' 

Adaptive  control  of  rigid  robot  manipulators  has  been  the  imaests 
of  many  researchers  for  several  years.  A  recent  survey  [1]  indicates 
the  breadth  of  the  approaches  taken.  In  joint  space,  one  of  the  early 
applications  of  adaptive  control  to  the  manipulator  problem  came 
from  Horowitz  and  Tomizuka  [2]  in  1980.  Their  scheme  assumed 
the  plant  inertia  to  be  time-invariant  and  assumed  the  gravrtyfinction 
torques  to  be  known.  Later  many  other  researchers  have  considered 
controlling  a  linear  perturbational/variatiooal  model,  and  cootioEmg 
the  full  nonlinear  manipulator  with  adaptive  “high  gam,”  see  PJ-J8]. 
Recently  several  globally  convergent  adaptive  control  resides  for 
manipulators  appeared  [9] — [13],  which  were  summarized  a  [14]. 
One  of  the  results  is  from  Craig  et  al.  [9],  which  relies  ob  the 
two  restrictive  assumptions.  The  first  is  that  the  joint  acceleration  is 
measurable  and  the  second  is  that  the  inverse  of  the  estimated  inertia 
matrix  remains  bounded.  The  need  to  measure  joint  acceleration  can 
be  overcome  by  the  introduction  of  a  first  order  filter  as  in  (Kg.  The 
second  assumption  is  more  restrictive  and  requires  modification  of 
the  parameter  update  law,  and  it  can  be  overcome  with  an  algorithm 
introduced  by  Spoog  and  Ortega  [15].  Their  idea  is  to  nse  foe 
fixed  estimate  instead  of  varying  estimate  for  inertia  matrix  and 
therefore  overcome  the  drawback  of  requirement  of  boundedness  of 
the  estimated  inertia  matrix.  Stotine  and  U  [II]  proposed  an  adaptive 
control  algorithm  which  requires  neither  die  measurements  of  the  joint 
accelerations  nor  die  bounded  inverse  of  die  estimated  inertia  retfox. 

A  composite  adaptive  control  law  in  joint  space  was  proponed  in 
[16],  which  used  die  prediction  error  and  die  joint  space  farting 
error  to  drive  die  parameter  estimator.  This  design  is  based  oafe 
observation  that  the  parameter  uncertainty  is  reflected  in  bed  die 
tradting  error  and  the  prediction  error.  Therefore  it  is  riestnHraad 
reasonable  to  extract  the  parameter  information  from  both  of  smbccs. 
In  addition,  Reed  and  loannoa  [17]  have  discussed  die  robefas 
issues  of  adaptive  control  algorithms  for  robotic  manipulators.  They 
applied  ^-modification  robust  adaptive  control  strategy  4o  adhere 
this  objective. 

However  in  practice,  robot  motion  is  basically  defined  by  die 
motion  of  its  end  effector,  Le.  in  robot  control,  the  major  concern 
is  that  the  end-effector  motion  tracks  its  desired  motion,  defined  m 
the  task  space  or  the  so-called  Cartesian  space. 
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ADAPTIVE  STABILIZATION  OF  MECHANICAL  SYSTEMS 
USING  ONLY  CONFIGURATION  MEASUREMENTS 

R.  Colbaugh  K.  Glass 

Department  of  Mechanical  Engineering 
New  Mexico  State  University,  Las  Cruces,  NM  88003  USA 

Abstract 

This  paper  presents  a  new  approach  to  stabilizing  uncertain  mechanical  systems  using 
only  measurements  of  the  system  configuration.  The  proposed  controller  is  computation¬ 
ally  simple,  does  not  require  knowledge  of  the  system  dynamic  model,  and  can  be  imple¬ 
mented  with  a  wide  variety  of  systems.  It  is  shown  that  the  control  strategy  guarantees 
semiglobal  uniform  boundedness  of  all  signals  and  convergence  of  the  system  configura¬ 
tion  error  to  zero.  If  bounded  external  disturbances  are  present,  a  slight  modification  to 
the  control  scheme  ensures  that  uniform  boundedness  of  all  signals  is  retained  and  that 
arbitrarily  accurate  stabilization  of  the  configuration  error  is  achieved.  The  capabilities  of 
the  proposed  controller  are  illustrated  though  both  computer  simulations  and  experiments 
involving  applications  of  importance  in  manufacturing.  These  studies  demonstrate  that 
the  controller  provides  a  simple  and  effective  means  of  obtaining  high  perform  a nee  error 
regulation;  additionally,  the  case  studies  indicate  that  accurate  trajectory  tracking  can  also 
be  realized  with  the  proposed  scheme. 

1.  Introduction 

The  promise  of  improved  productivity,  enhanced  reliability,  reduced  costs,  and  in¬ 
creased  human  safety  has  motivated  the  introduction  of  computer-controlled  mechanical 
systems  in  the  manufacturing  sector,  in  space,  and  in  applications  involving  hazardous 
environments.  This  implementation  of  mechanical  systems  has  revealed  that,  in  many  ap¬ 
plications,  achieving  the  anticipated  benefits  requires  the  development  of  control  systems 
capable  of  providing  good  performance  in  the  presence  of  incomplete  information  regarding 
the  system  and  its  environment.  Additionally,  this  work  has  indicated  the  importance  of 
the  fundamental  task  of  system  stabilization,  that  is,  of  devising  control  laws  which  ensure 
that  any  prescribed  system  state  becomes  an  asymptotically  stable  equilibrium  for  the 
controlled  system. 

This  paper  considers  the  problem  of  stabilizing  mechanical  systems  which  have  at  least 
one  actuator  for  each  configuration  degree-of-treedom.  These  systems  are  typically  referred 
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to  as  fully  actuated  mechanical  systems,  and  are  defined  more  precisely  in  the  following 
section.  For  now  it  is  simply  noted  that  the  class  is  quite  large  and  includes  virtually  all 
systems  of  importance  in  manufacturing,  such  as  robotic  manipulators  and  machine  tools. 
Most  research  on  the  stabilization  of  mechanical  systems  has  focused  on  those  systems  for 
which  complete  model  information  and  state  measurements  are  available;  work  in  this  area 
includes  investigations  of  general  classes  of  mechanical  systems  [e.g.,  1-6]  as  well  as  studies 
focusing  on  particular  systems,  such  as  robotic  manipulators  [7-9]  and  spacecraft  [10,11]. 
As  a  result  of  these  studies,  it  is  now  well-known  that  simple  proportional-derivative  (PD) 
feedback  controllers  are  capable  of  globally  stabilizing  fully  actuated  mechanical  systems, 
provided  that  the  effects  of  any  system  potential  energy  (such  as  gravity)  are  compensated. 
Ordinarily  the  requisite  potential  energy  compensation  is  achieved  by  including  a  model  of 
the  (gradient  of  the)  potential  energy  directly  in  the  control  scheme,  so  that  the  resulting 
controller  possesses  a  PD-plus-potential-model  structure. 

While  the  PD-plus-potential-model  stabilization  strategy  is  simple,  elegant,  and  intu¬ 
itively  appealing,  there  are  two  potential  difficulties  associated  with  this  approach.  First, 
these  controllers  include  a  linear  state  feedback  component  and  therefore  require  full  state 
measurement,  which  can  be  problematic  in  practice.  For  example,  although  robotic  ma¬ 
nipulators  and  machine  tools  are  generally  equipped  with  high-precision  sensors  capable  of 
accurately  measuring  system  position,  determining  the  system  velocity  is  more  challenging 
and  typically  the  velocity  measurements  are  contaminated  with  noise  [12].  In  fact,  veloc¬ 
ity  sensors  are  frequently  omitted  altogether  in  the  interest  of  saving  money,  volume,  and 
weight  in  construction  [12].  The  second  drawback  with  PD-plus-potential-model  stabiliza¬ 
tion  schemes  is  the  need  to  include  the  system  potential  energy  model  in  the  control  law. 
Observe  that  this  approach  to  compensating  the  effects  of  the  system  potential  energy  can 
be  undesirable  because  its  implementation  requires  precise  knowledge  of  the  structure  and 
parameter  values  of  the  potential  energy  model.  This  is  particularly  restrictive  for  man¬ 
ufacturing  automation  applications,  because  in  typical  tasks  the  manufacturing  system 
encounters  many  different  tools,  payloads,  and  workpieces,  and  it  is  unrealistic  to  assume 
that  the  properties  of  all  of  these  are  accurately  known.  In  addition,  this  requirement 
limits  the  modularity  and  portability  of  the  controllers,  so  that  implementation  of  these 
schemes  with  a  new  system  can  involve  considerable  effort. 

Recognizing  the  aforementioned  difficulties,  several  researchers  have  studied  methods 
of  addressing  one  or  the  other  problem.  For  instance,  recently  several  schemes  have  been 
proposed  for  controlling  robotic  manipulators  using  only  position  measurements  [13-17]. 
One  approach  to  this  problem  is  to  design  an  observer  for  estimating  the  velocity  utilizing 
the  available  (high-quality)  position  inforriiatiiSh,  and  then  to  use  this  estimated  velocity 
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in  place  of  the  actual  velocity  in  the  state  feedback  component  of  the  control  law;  this 
strategy  is  used  in  [13-15]  to  solve  the  manipulator  tracking  problem.  Alternatively,  the 
simple  structure  of  the  PD-plus-potential-model  solution  to  the  mechanical  system  stabi¬ 
lization  problem  suggests  a  correspondingly  simple  solution  to  the  stabilization  problem 
when  only  position  measurements  are  available.  Thus  elegant  control  laws  composed  of  a 
potential  energy  compensation  term  and  a  linear  dynamic  (first  order)  compensator  have 
been  proposed  very  recently  in  [16,17].  It  is  noted  that  while  these  schemes  eliminate  the 
need  for  velocity  measurements,  they  still  require  knowledge  of  the  potential  energy  model 
for  the  robotic  manipulator  as  well  as  any  payload  to  provide  accurate  position  control. 

Research  focusing  on  reducing  the  information  required  concerning  the  mechanical 
system  potential  energy  has  tended  to  concentrate  on  adaptive  control  approaches.  The 
schemes  presented  in  [18,19]  possess  a  PD-plus-potential-model  form  in  which  only  the 
structure  of  the  potential  energy  model  must  be  known;  the  (constant)  parameters  ap¬ 
pearing  in  the  model  are  assumed  to  be  unknown,  and  this  uncertainty  is  compensated 
adaptively.  The  information  required  concerning  the  potential  energy  is  reduced  further  in 
the  controller  proposed  in  [20],  where  a  scheme  is  given  that  adaptively  compensates  for 
uncertainty  regarding  both  the  structure  and  the  parameter  values  of  the  system  potential 
energy.  It  is  mentioned  that  the  adaptive  controllers  presented  in  [18-20]  all  possess  a  PD 
feedback  component,  and  consequently  all  require  velocity  measurement.  An  additional 
observation  concerning  this  work  is  that  none  of  the  stabilization  schemes  [16-20]  have 
been  verified  through  a  comprehensive  simulation  and  experimental  study. 

This  paper  introduces  a  new  adaptive  stabilization  scheme  for  fully  actuated  mechan¬ 
ical  systems  that  eliminates  the  need  for  any  knowledge  of  the  system  dynamic  model  and 
that  can  be  implemented  using  only  measurements  of  the  system  configuration  -  It  is  shown 
that  the  control  strategy  guarantees  semiglobal  uniform  boundedness  of  all  signals  and  con¬ 
vergence  of  the  system  configuration  error  to  zero.  If  bounded  external  disturbances  are 
present,  a  slight  modification  to  the  control  scheme  ensures  that  uniform  boundedness  of 
all  signals  is  retained  and  that  arbitrarily  accurate  stabilization  of  the  configuration  error 
is  achieved.  The  efficacy  of  the  proposed  controller  is  illustrated  through  both  computer 
simulations  and  experiments  involving  three  important  manufacturing  tasks:  cutting  force 
control  in  machining,  control  of  contouring  with  a  machine  tool,  and  position  control  of 
an  industrial  robot. 

The  paper  is  organized  as  follows.  In  Section  2  some  preliminary  facts  of  relevance  to 
the  mechanical  system  stabilization  problem  are  established.  The  adaptive  stabilization 
scheme  is  presented  and  analyzed  in  Section  3.  The  performance  of  the  controller  is 
illustrated  in  Section  4  through  a  computer  ISilnulation  study  and  in  Section  5  through 
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an  experimental  investigation.  Finally,  Section  6  summarizes  the  paper  and  draws  some 
conclusions. 

2.  Preliminaries 

Consider  the  class  of  mechanical  systems  that  possess  n  configuration  degrees-of- 
freedom  (DOF)  and  m  >  n  actuators  (of  which  n  are  independent).  It  can  be  shown 
that  this  class  of  systems  can  be  modeled  quantitatively  as  follows  [21]: 

M(x)T  =  if(x)x  +  Fcc(x,x)x  +  G(x)  +  d(x,x,f)  (1) 

where  x  €  3fre  is  the  vector  of  system  generalized  coordinates,  T  G  &m  is  the  vector  of  ac¬ 
tuator  inputs,  M  G  9£nXm  is  a  bounded  surjective  matrix,  H  G  3£nXn  is  the  system  inertia 
matrix,  Vcc  G  3?nXn  quantifies  Coriolis  and  centripetal  acceleration  effects,  and  G  G 
arises  from  the  system  potential  energy.  The  term  d  G  3?n  is  an  unknown  vector,  assumed 
to  be  bounded  with  bounded  first  derivative  with  respect  to  time,  that  can  represent  un¬ 
modelled  state- dependent  effects  or  time-dependent  external  disturbances;  this  term  is 
included  in  the  system  dynamics  to  permit  the  robustness  of  the  proposed  stabilization 
scheme  to  be  studied.  It  is  well-known  that  the  dynamics  (1)  possesses  considerable  struc¬ 
ture  [e.g.,  21].  For  example,  it  is  easy  to  show  that,  for  any  set  of  generalized  coordinates 
x,  the  matrix  H  is  symmetric  and  positive-definite,  the  matrix  Vcc  depends  linearly  on  x, 
and  the  matrices  H  and  Vcc  are  related  according  to  E  —  Vcc  4-  V£  [21].  Additionally, 
we  will  assume  in  what  follows  that  the  inertia  matrix  H  and  potential  energy  gradient 
G  are  bounded  with  bounded  first  partial  derivatives;  this  latter  property  holds  for  those 
mechanical  systems  of  interest  in  manufacturing  applications,  such  as  robotic  manipulators 
and  machine  tools. 

Observe  from  (1)  that  the  vector  of  actuator  inputs  T  does  not  affect  the  system 
dynamics  directly,  and  instead  influences  the  system  evolution  through  the  column-space 
of  M.  By  definition  the  columns  of  M  span  3£n,  so  that  (1)  is  fully  actuated  and  the 
situation  is  straightforward.  However,  it  is  very  useful  to  explicitly  identify  the  generalized 
force  vector  F  G  associated  with  the  generalized  coordinate  vector  x: 

F  =  M(x)T  (2) 

so  that  (1)  becomes: 

F  =  H(x)x  +  Vcc(x,  x)x  +  G(x)  +  d(x,  x,  t )  (3) 

If  m  >  n  then  there  exists  an  infinite  number  of  actuator  inputs  corresponding  to  a  given 
generalized  force  F,  which  suggests  that  Tcan^e  chosen  to  enhance  the  performance  of  the 
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system  in  some  way.  This  observation  and  the  partitioned  representation  of  the  system 
(2), (3)  motivates  the  development  of  a  control  system  that  consists  of  two  subsystems. 
The  first  subsystem  is  an  adaptive  scheme  that  generates  the  control  input  F  required 
to  stabilize  the  system  (3),  and  the  second  subsystem  is  an  algorithm  for  specifying  the 
actuator  input  T  that  provides  the  desired  F  and  simultaneously  utilizes  any  available 
actuator  redundancy  to  improve  performance.  Partitioning  the  control  problem  in  this 
way  is  very  natural  and  can  be  quite  effective  when  applied  to  specific  physical  systems 
[22-24].  Given  (2),  an  actuator  input  T  corresponding  to  the  desired  control  input  F  can 
always  be  determined;  this  problem  is  discussed  extensively  in  [22-24]  and  the  references 
therein.  Therefore  for  the  remainder  of  the  paper  it  is  assumed  that  the  control  input  F 
can  be  commanded  directly,  and  the  focus  of  the  subsequent  discussion  is  on  the  problem 
of  stabilizing  the  system  (3)  in  the  presence  of  incomplete  information  regarding  this 
mechanical  system.  More  specifically,  the  control  objective  is  to  specify  the  control  input 
F  to  the  system  (3),  using  only  measurements  of  x  and  without  any  a  ‘priori  information 
concerning  the  system  dynamic  model,  in  such  a  way  that  the  system  (3)  evolves  from  its 
initial  state  to  the  desired  final  state  x  =  x^,  x  =  0. 

Our  approach  to  the  design  and  analysis  of  a  suitable  controller  is  based  on  Lyapunov 
stability  theory.  The  following  two  lemmas  summarize  certain  facts  regarding  a  class  of 
Lyapunov  functions  and  will  be  of  direct  relevance  in  our  development.  The  first  lemma 
establishes  a  useful  result  concerning  convergence  of  a  dynamical  system  to  a  neighborhood 
of  the  origin. 

Lemma  1:  Consider  the  coupled  dynamical  system  Xi  =  fi(xi,X2,<),  X2  =  f2(xi,X2,t). 
Let  V (xj, X2,  t)  be  a  Lyapunov  function  for  the  system  with  the  properties 

Ai  ||  xi  ||2  +A2  ||  x2  ||2<  V  <  As  ||  xj  ||2  +A,  ||  x2  ||2 
V  <  —As  ||  x2  ||2  -Ae  ||  x2  ||2  +s 

where  e  and  the  A  £  are  positive  scalar  constants.  Define  6  =  max(A3/A5,  A4/A6)  and 
rf  =  (Se/Xij1/2  for  i  =  1,2.  Then  for  any  initial  state  Xi(0),X2(0)  the  system  will  evolve  so 
that  xi  (<),  X2  (f )  are  uniformly  bounded  and  converge  exponentially  to  the  closed  balls  BTl, 
Br2 ,  respectively,  where  Bri  =  {x,-  :  ||  x,  ||<  r;}  (see  [25]  for  a  discussion  of  exponential 
convergence  to  a  closed  ball). 

Proof:  The  proof  is  based  on  an  extension  of  the  global  exponential  convergence  theorem 
of  Corless  [25]  and  is  given  in  [26].  □ 

The  next  result  is  a  special  case  of  LaSalle’s  invariant  set  theorem. 

2  - 125 

Lemma  2:  Consider  the  coupled  dynamical  system  Xj  =  fi(xi,  X2),  X2  =  f2(x1,X2)  with 
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equilibrium  at  the  origin.  Let  V(xi,X2)  be  a  Lyapunov  function  for  the  system  with  the 
properties 

A,  ||  Xl  ||2  +A2  ||  x2  ||2<  V  <  X3  ||  x,  ||2  +A4  ||  x2  ||2 
V  <  -As  ||  xa  f 

where  the  A,  are  positive  scalar  constants.  Then  the  origin  is  a  stable  equilibrium,  and 
Xj  — *  0  as  t  — ►  oo. 

Proof:  The  proof  can  be  found  in  [e.g.,  27].  □ 

3.  Adaptive  Stabilization  Scheme 

We  now  turn  to  the  development  of  the  proposed  approach  to  stabilizing  uncertain 
mechanical  systems  using  only  configuration  measurements.  Before  presenting  our  main 
results,  we  briefly  consider  the  nonadaptive  case  to  fix  some  ideas  and  introduce  a  useful 
Lyapunov  function. 

3.1  Nonadaptive  Case 

Consider  the  problem  of  stabilizing  the  mechanical  system  (3)  using  only  measurements 
of  system  configuration  but  with  complete  knowledge  of  the  system  dynamic  model.  In 
[16],  Berghuis  and  Nijmeijer  proposed  the  following  elegant  solution  to  this  problem  for 
robotic  manipulators  based  on  the  seminal  work  of  Takegaki  and  Arimoto  [7]: 

F  =  G(x)  +  &i  72w  +  k2  j2e  (4) 

w  =  — 2yw  +  72e 

where  e  =  x,*  —  x  is  the  position  regulation  error  (recall  that  x^  is  constant),  k1,k2,'y  are 
positive  scalar  constants,  and  where  we  have  reformulated  the  scheme  given  in  [16]  some¬ 
what  to  make  our  subsequent  analysis  more  convenient.  Note  that  (4)  is  implementable 
without  velocity  information  because,  although  w  depends  on  e,  the  control  law  requires 
only  w  and  this  term  depends  only  on  e.  It  is  shown  in  [16]  that  (4)  globally  stabilizes 
the  dynamics  (3)  at  the  equilibrium  e  =  e  =  w  =  0;  however,  the  stability  analysis  in  [16] 
employs  a  Lyapunov  function  with  a  negative-semidefinite  time  derivative,  so  that  it  is  not 
clear  how  to  extend  this  analysis  to  the  adaptive  case.  In  what  follows,  we  present  an  al¬ 
ternative  stability  analysis  which  demonstrates  that  the  equilibrium  point  is  exponentially 
stable,  and  which  forms  the  basis  for  our  derivation  of  an  adaptive  solution  to  the  problem. 

Lemma  3:  The  controller  (4)  exponentially  stabilizes  the  nominal  mechanical  system 

dynamics  (3)  (with  d  =  0)  at  the  equilibrium  e  =  e  =  w  =  0  provided  7  is  chosm 
_  .  ,  .  2-126 
sufficiently  large. 
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Proof:  Applying  the  controller  (4)  to  the  nominal  mechanical  system  dynamics  (3)  yields 
the  closed-loop  system 

He  +  Vcce  +  kij2w  +  k2/y2e  =  0  (5) 

w  =  — 27W  +  72e 

Consider  the  Lyapunov  function  candidate 

Vi  =  \eTHe  +  \k2  j2eTe  +  +  7^- eTHe  -  -wTHe  (6) 

2  2  2  fci7  7  v  ' 

and  note  that  V\  in  (6)  is  positive-definite  and  radi ally-unbounded  provided  7  is  chosen 
large  enough.  Differentiating  (6)  along  (5)  and  simplifying  yields 

Vi  =  -(7  -  -^-)eTHe  -  ||  e  ||2  -fcl7  ||  w  ||2  +2w THe 

m7  M 

+  ~eT[j^Vcce  -  Vccw] 

7  kx 

<  -(7  -  ^-)A ™.(F)  ||  e  ||2  -Si  ||  e  f  -ilT  ||  w  ||2 

fci7  ki 

+  2\maI(H)  ||  w  HU  e  ||  +^££  ||  e  ||||  e  f  ||  w  ||||  e  ||2  (7) 

where  Amjn(-),  Xmax(‘)  denote  the  minimum  and  maximum  eigenvalue  of  the  matrix  ar¬ 
gument,  respectively,  and  kcc  is  a  scalar  upper  bound  on  the  linear  dependency  of  Vcc 
on  x  (i.e.  ,  II  Vcc  Ilf  <  kcc  ||  x  ||  Vx  with  ||  •  \\F  the  Frobenius  matrix  norm).  Define 

z  =  III  e  II  lie  ||  ||  w  ||]T  and 

0  0 

(l )  -XmaX(H) 

^ rriax  (H)  h  7  J 

and  note  that  Qi  is  positive-definite  if  7  is  chosen  sufficiently  large.  The  following  bound 
on  Vi  in  (7)  can  then  be  established: 

v,  <  -zTC.z  +  II  e  llll  e  f  +' Y  II  w  INI  e  ||2 
<-(Amj„(0.)-^1/2)  II*  H2  (8) 

for  some  scalar  constant  ai  which  does  not  increase  as  7  is  increased.  Let  ci(t)  = 
Xmin(Q  1)  —  V\ {t)H  and  choose  7  large  enough  so  that  ci(0)  >  0  (this  is  always 


Qi  =  0 

_  0 
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possible).  Then  (8)  implies  that  c^O)  <  a(t)  Vi,  so  that  Vi  <  -ci(0)  ||  z  ||2.  Lemma  2 
(with  e  =  0)  then  permits  the  conclusion  that  e,  e,  w  converge  exponentially  to  zero.  □ 

Observe  that  in  the  preceding  stability  analysis  a  negative-definite  Lyapunov  function 
derivative  has  been  obtained.  It  will  be  shown  in  what  follows  that,  as  a  consequence, 
this  analysis  can  be  readily  extended  to  the  adaptive  case.  Note  also  that  the  stability 
properties  established  in  Lemma  3  are  semiglobal  in  the  sense  that  the  region  of  attraction 
can  be  increased  arbitrarily  by  increasing  the  controller  gain  7.  It  is  stressed  that  this  does 
not  imply  that  7  must  be  chosen  to  be  overly  large;  indeed,  we  have  found  that  excellent 
performance  is  obtainable  with  this  gain  set  to  quite  modest  values. 

3.2  Adaptive  Stabilization 

As  indicated  above,  it  is  desirable  to  eliminate  the  need  for  information  concerning  the 
potential  energy  model  in  mechanical  system  stabilization  schemes.  Consider  the  following 
adaptive  approach  to  realizing  this  objective: 

F  =  f  (t)  +  &i72w  +  &272e 

w  =  — 27W  +  72e  (9) 

f  =  /3(e  +  -j—e  -  -w) 

*17  7 

where  f(<)  €  is  the  adaptive  element  and  fi  is  a  positive  scalar  constant;  it  is  dear 
that  the  adaptive  term  f(i)  is  intended  to  compensate  for  potential  energy  effects.  Ob¬ 
serve  that  (9)  is  implementable  without  velocity  information  because,  although  w  and  f 
depends  on  e,  the  control  law  terms  w  and  f  depend  only  on  e  (as  can  be  verified  through 
direct  integration  of  the  w  and  f  equations).  Applying  the  control  law  (9)  to  the  nominal 
mechanical  system  dynamics  (3)  (with  d  =  0)  yields  the  closed-loop  error  dynamics 

He  +  Vcce  +  ki  t2w  +  k2j2e  +  $  +  G(xrf)  —  G(x)  =  0 

w  =  -27W  +  72e  (10) 

f  =  /3(e  +  -^_e  -  -w) 

«i7  7 

where  $  =  f  —  G(x<i). 

The  stability  properties  of  the  proposed  adaptive  stabilization  strategy  (9)  are  sum¬ 
marized  in  the  following  theorem. 

Theorem  1:  The  adaptive  scheme  (9)  ensures  that  origin  e  =  e  =  w  =  $  =  0isa 
semiglobally  stable  equilibrium  point  for  (10),  and  that  e,  e,  w  converge  to  zero  asvmp- 

'y  _  * 

totically,  provided  7  is  chosen  sufficiently  large. 
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Proof:  Let  U(x)  denote  the  potential  energy  of  the  mechanical  system  (3).  Consider  the 
Lyapunov  function  candidate 

V2  =  Vi  +  +  U(x)  -  U(xd)  +  GT(xd)e  (11) 

and  note  that  V2  is  a  positive-definite  and  radially-unbounded  function  of  e,  e,  w  and  <& 
if  7  is  chosen  sufficiently  large  (for  example,  if  7  is  chosen  large  enough  then  the  term 
k2'y2eTe/2  +  U(x)  —  U(xd)  +  GT(x<*)e  can  be  shown  to  be  positive-definite  in  e  using  an 
analysis  analogous  to  the  one  given  in  [18]). 

Computing  the  derivative  of  (11)  along  (10)  and  simplifying  as  in  the  proof  of  Lemma 
3  yields 

V2  <  -(7  -  II  e  ||2  ||  e  ||2  -ki7  II  w  ||2 

+  2A m„(H)  ||  w  HU  e  ||  ||  e  ||||  e  ||2  ||  w  ||||  e  ||2 

+  ^*T(f  -  /SI®  +  J^e-  (wlj+  II  G(xj)  -  G(x)  ||||  -  (w  || 

<  ~(7  -  £7)  II  «  II2  -(^  -  II  e  ||2  -h7  ||  w  ||2  (12) 

+  2 II  w  nil  e  II  II  w  nil  e  ||  + ||  e  1111  e  ||2  +%  ||  w  ||||  e  ||2 

7  fci  7  7 

where  M  is  a  positive  scalar  constant  satisfying  M  ||  xd  —  x  ||>||  G(x<f)  —  G(x)  ||  \/xd,x 
(the  boundedness  of  the  partial  derivatives  of  G  ensures  that  such  an  M  exists).  Define  z 
as  in  Lemma  3  and  Q2  as 

0  ——  1 

-X  max{H) 

^ max  (H)  h7  J 

and  note  that  Q2  is  positive-definite  if  7  is  chosen  large  enough.  Then  the  following  bound 
on  V2  in  (12)  can  be  obtained  through  routine  manipulation: 

V2  <  -(Ami„(Q2)  -  — V2112)  II  *  II2  (13) 

7 

where  a2  is  a  scalar  constant  which  does  not  increase  as  7  is  increased.  Let  c2(t)  = 
Xmin(Q2)  ~  «2^21/^2(i)/7  and  choose  7  large  enough  so  that  c2(0)  >  0.  Then  (13)  implies 
that  c2(0)  <  c2(t)  Vi,  so  that  V2  <  — c2( 0^  j|1z9||2.  Lemma  2  then  permits  the  conclusion 


Q2  = 


fc,-y 
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that  e=e  =  w  =  <&  =  Ois  (semiglobally)  stable  and  that  e,  e,  w  converge  to  zero 
asymptotically.  o 

Several  observations  can  be  made  concerning  the  adaptive  stabilization  strategy  (9). 
First,  the  preceding  analysis  demonstrates  that  this  scheme  ensures  that  the  nominal  me¬ 
chanical  system  dynamics  (3)  evolves  from  any  initial  state  to  any  desired  final  configura¬ 
tion  with  bounded  signals  and  with  convergence  of  the  state  error  to  zero.  The  controller 
requires  no  a  "priori  information  concerning  the  system  model  and  is  implementable  man; 
only  configuration  measurements.  Thus  the  proposed  scheme  provides  a  computationally 
efficient,  modular,  and  readily  implementable  solution  to  the  mechanical  system  stabiliza¬ 
tion  problem.  An  additional  observation  is  that  the  scalar  controller  gains  k\,  7,  and  /? 

can  be  replaced  with  the  appropriate  diagonal  matrices  for  increased  flexibility  and  perfor¬ 
mance;  scalar  gains  are  used  in  the  above  analysis  for  simplicity  of  development.  Finally, 
it  is  interesting  to  note  that  the  proposed  controller  (9)  can  be  readily  modified  to  provide 
increased  robustness  to  unmodeled  effects  and  external  disturbances;  this  modification  is 
considered  next. 

3.3  Robust  Adaptive  Stabilization 

Consider  the  following  modification  of  the  adaptive  stabilization  scheme  (9): 

F  =  f(i)  +  £j72w  +  fc2 72e 

w  =  — 27W  +  t2^  (14) 

f  -  -erf  +  £(e  4-  y^-e  -  iw) 

*17  7 

where  a  is  a  positive  scalar  constant.  Applying  the  controller  (14)  to  the  disturbed  me¬ 
chanical  system  dynamics  (3)  (with  d  nonzero)  yields  the  closed-loop  error  dynamics 

He  +  Vcce  +  &172  w  +  k2 72e  +  =  0 

w  =  —  27W  +  y*e  (15) 

f  =  -af  +  /3(e  +  y—e  —  -w) 

*i7  7 

where  ^  =  f  —  G  —  d. 

The  stability  properties  of  the  proposed  adaptive  stabilization  strategy  (14)  are  sum¬ 
marized  in  the  following  theorem. 

Theorem  2:  The  adaptive  scheme  (14)  ensures  that  e,e,w ,  are  globally  uniformly 
bounded  provided  that  7  is  chosen  sufficiently3farge.  Moreover,  the  stabilization  error  e_e 
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is  guaranteed  to  converge  exponentially  to  a  compact  set  which  can  be  made  arbitrarily 
small. 

Proof:  Consider  the  Lyapunov  function  candidate 


V3  =  v1  +  (16) 

and  note  that  V3  is  a  positive-definite  and  radially-unbounded  function  of  e,  e,  w  and  3*^ 
if  7  is  chosen  sufficiently  large.  Differentiating  (16)  along  (15)  and  simplifying  as  in  the 
proof  of  Lemma  3  yields 


y>  <  -(T  -  ^)Amj .(B)  II  e  f  -Sr  ||  e  |p  -kn  ||  w  f 

+  2A ma,(H)  ||  w  HU  e  ||  ||  e  mi  e  |p  +S  ||  w  ||||  *  |p 

+  i*J(f  -  G  -  d  -  /J(e  +  p-e  -  iw)) 

P  *17  7 


Defining  z  as  in  Lemma  3  and  Qz  as 


Qz  — 


Mi 

ki 

0  (2. Lu 

u  '  2  kn 

0  — Am, 


0 

)A  min(H) 

„«(#) 


0 

Amoz(LT) 

&i7 


(17) 


(and  noticing  that  Qz  is  positive-definite  if  7  is  chosen  large  enough)  permits  the  following 
bound  on  V3  in  (17)  to  be  obtained: 

vs  <  -(a -  ^n1/2)  11  *  ip  ~  n  ip 

-  | A ,min(H)  ||  e  |p  ||  *d  |p  +fi  ||  HU  e  ||  +J  ||  || 

<  -(A .„<«(«,)  -  ^fy1/2)  II  *  IP  ~  II  IP  +J  (18) 

where  the  at  are  scalar  constants  which  do  not  increase  as  7,  /?  increase. 

The  remainder  of  the  proof  will  utilize  the  result  given  in  Lemma.  1,  and  therefore  it 
is  convenient  to  bound  V3  in  (16)  in  the  following  manner: 


where  the  A;  are  positive  scalar  constants  independent  of  /?.  Now  choose  two  scalar  con¬ 
stants  Vm,  vm  so  that  Vm  >  vm  >  V3(0),2aruf define  cM  =  \min(Qz)  ~  <*3VmI/2/t;  then 
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choose  7  large  enough  so  that  cm  >  0  (this  is  always  possible).  Let  8  =  max( A3 /cm ,  2A2 / <x) 
and  choose  /9o  so  that  a^S/Po  <  Vm  (this  is  always  possible).  Then  selecting  ft  >  ft0  en¬ 
sures  that  if  Vm  <  V3  <  Vm  then  V3  <  0.  This  condition  together  with  Vm  >  Vm  >  14(0) 
implies  that  V3(t)  <  Vm  Vt,  so  that  c3(t)  =  Amin(Q3)  —  (XzV^^it)/^  >  cm  >  0  Vf  and 


V3  <  -cm  ||  z 


.  Il2 


where  A/9  =  ft  —  fto  and  it  is  assumed  that  /9  is  chosen  so  that  A/9  >  0.  Lemma  1  now 
applies  and  permits  the  conclusion  that  ||  z  ||,  ||  3?^  ||  are  uniformly  bounded,  which  implies 
that  e,  e,  w,  f  are  uniformly  bounded.  Moreover,  ||  z  ||,  |(  $*d  ||  converge  exponentially  to 
the  closed  balls  Bri ,  BT2 ,  respectively,  where 

=  (  foe  V/; 

\Ai(^o+A^)y 


Observe  that  the  radius  of  the  ball  to  which  ||  z  ||2=||  e  ||2  +  ||  e  ||2  +  ||  w  ||2  is  guaranteed 
to  converge  can  be  decreased  as  desired  simply  by  increasing  A/9.  □ 

Several  observations  can  be  made  concerning  the  adaptive  stabilization  strategy  (14). 
First,  the  preceding  analysis  demonstrates  that  this  scheme  ensures  that  the  disturbed, 
mechanical  system  dynamics  (3)  evolves  from  any  initial  state  to  any  desired  final  config¬ 
uration  with  uniform  boundedness  of  all  signals  and  with  exponential  convergence  of  the 
state  error  to  an  arbitrarily  small  neighborhood  of  the  origin.  Note  that  the  ultimate  size 
of  the  regulation  error  can  be  reduced  as  desired  simply  by  increasing  a  single  controller 
parameter  (the  adaptation  gain  /9),  and  that  exponential  convergence  ensures  that  the 
transient  behavior  of  the  errors  is  well-behaved.  The  controller  does  not  require  knowledge 
of  the  structure  or  parameter  values  for  the  sytem  model,  and  is  implementable  using 
only  configuration  measurements.  Thus  the  proposed  scheme  provides  a  computationally 
efficient,  modular,  readily  implementable,  and  robust  solution  to  the  mechanical  system 
stabilization  problem.  An  additional  observation  is  that  the  scalar  controller  gains  Aq,  &2, 
7,  <7,  and  /9  can  be  replaced  with  the  appropriate  diagonal  matrices  for  increased  flexibility 
and  performance;  scalar  gains  are  used  in  the  above  analysis  for  simplicity  of  development. 


4.  Case  Studies:  Simulation  Results 

This  Section  describes  the  results  of  a  computer  simulation  study  involving  the  appli- 

2  - 132 

cation  of  the  proposed  adaptive  stabilization  strategy  (14)  to  two  important  machine  tool 
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control  problems:  cutting  force  control  in  milling  and  contouring  control  in  machining.  In 
each  of  these  applications  the  plant  dynamics  can  be  modeled  in  the  general  form  (2), (3), 
and  in  each  case  there  is  considerable  motivation  for  seeking  a  stabilization  scheme  which 
does  not  require  rate  measurements  or  specific  model  information.  Throughout  the  study 
the  unit  of  time  is  second,  the  unit  of  force  is  Newton,  and  the  unit  of  length  is  millimeter, 
unless  stated  otherwise. 

4.1  Cutting  Force  Control  in  Milling 

It  has  been  recognized  that  adaptively  controlling  the  cutting  forces  and  torques  en¬ 
countered  during  machining  operations  could  permit  material  removal  rates  to  be  increased 
while  maintaining  safe  machining  conditions  [e.g,  28,29].  Previous  approaches  to  designing 
adaptive  controllers  for  this  application  have  usually  been  based  on  linearized  models  of 
the  milhng  process  and  have  often  assumed  that  force  rate  information  is  available  (which 
is  not  realistic).  As  a  consequence,  there  has  been  concern  regarding  the  stability  of  the 
schemes  and  the  application  of  these  control  systems  has  been  limited.  In  this  example 
we  consider  the  problem  of  controlling  the  cutting  force  in  milling  by  manipulating  the 
feedrate  override  voltage  using  only  measurements  of  cutting  force.  The  work  reported  in 
[28,29]  shows  that  the  plant  associated  with  this  control  problem  is  a  mechanical  system 
of  the  general  form  (2), (3)  with  m  =  n  =  1;  the  precise  form  of  the  model  is  strongly 
dependent  on  the  machine  tool  and  workpiece  and  it  is  usually  not  realistic  to  assume  that 
this  model  is  known  with  any  precision. 

This  simulation  examines  the  capability  of  the  proposed  stabilization  scheme  (14)  to 
accurately  and  robustly  control  cutting  force  in  machining.  The  simulation  utilizes  the 
cutting  force  model  developed  in  [28,29],  and  all  model  and  task  parameters  are  as  given 
in  [29].  It  is  stressed  that  this  model  information  is  used  only  for  computer  simulation  of 
the  system  and  is  not  made  available  to  the  controller.  The  cutting  tool  is  initially  not  in 
contact  with  the  workpiece,  and  the  tool  is  requested  to  exert  a  cutting  force  of  500N  on 
the  workpiece;  thus  the  cutting  tool  is  driven  to  the  workpiece  under  the  control  of  the 
proposed  stabilization  scheme.  Then,  midway  through  the  cut  (at  t  =  5.0s),  the  depth  of 
cut  is  suddenly  reduced  by  a  factor  of  four,  causing  an  abrupt  change  in  the  system  model 
(which  is  sensitive  to  changes  in  depth  of  cut).  The  adaptive  gain  f  is  initialized  to  zero,  and 
the  remaining  controller  parameters  are  set  as  follows:  k\  =  10,  k2  =  10,  7  =  5 ,  <t  =  0.01, 
0  =  1000.  The  adaptive  controller  is  applied  to  the  milling  machine  with  a  sampling  period 
of  two  milliseconds,  and  all  integrations  required  by  the  controller  are  implemented  using 
a  simple  trapezoidal  integration  rule  with  a  time-step  of  two  milliseconds.  The  results  of 
this  simulation  are  given  in  Figure  1,  anc^in^ate  that  accurate  cutting  force  regulation 
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is  achieved  with  the  proposed  scheme  (14).  Additionally,  this  simulation  shows  that  the 
controller  is  robust  to  the  abrupt  and  unexpected  change  in  the  system  dynamics  which 
occurs  when  the  depth  of  cut  is  suddenly  reduced  midway  through  the  task. 

4.2  Contouring  with  Machine  Tools 

The  contouring  problem  for  multiaxis  machine  tools  involves  controlling  the  motion 
of  the  cutting  tool  relative  to  the  workpiece  so  that  the  tool  accurately  tracks  a  given 
trajectory  [e.g.,  30,31].  The  controllers  which  are  presently  employed  for  this  application 
are  typically  designed  using  classical  techniques  under  the  assumption  that  the  system 
is  linear,  and  one  drawback  with  this  approach  is  that  tracking  accuracy  degrades  if  the 
desired  trajectory  requires  fast  motions  or  involves  tight  contours.  Another  potential 
problem  with  these  schemes  is  their  reliance  on  velocity  information,  since  measurements 
of  velocity  are  often  contaminated  with  noise  [12].  Here  we  consider  implementing  the 
proposed  stabilization  scheme  (14)  for  contouring  control  with  a  two-axis  milling  machine. 
The  investigations  [30,31]  show  that  the  plant  associated  with  this  control  problem  is  a 
mechanical  system  of  the  general  form  (2), (3)  with  m  =  n  =  2;  the  details  of  the  model 
depend  on  the  machine  tool  and  workpiece  and  are  not  likely  to  be  known  with  any 
precision. 

This  simulation  examines  the  capability  of  the  proposed  stabilization  scheme  (14)  to 
accurately  track  a  given  contouring  trajectory  with  a  two-axis  milling  machine.  Note 
that,  although  the  controller  (14)  is  not  designed  for  tracking,  it  can  be  expected  that 
the  adaptive  nature  of  the  scheme  should  permit  good  tracking  accuracy  to  be  obtained. 
This  simulation  utilizes  the  machine  tool  model  presented  in  [31],  and  all  model  and  task 
parameters  are  as  given  in  [31].  Again,  it  is  stressed  that  this  model  information  is  used 
only  to  simulate  the  dynamical  system  and  is  not  made  available  to  the  controller.  As 
indicated  above,  a  potential  advantage  of  this  approach  to  mechanical  system  control  is 
that  velocity  information,  which  is  typically  noisy,  is  not  required  by  the  controller.  To 
examine  this  potential,  the  proposed  scheme  (14)  is  implemented  both  with  and  without 
velocity  measurements  (in  the  case  that  velocity  is  available,  this  information  is  simply 
used  in  place  of  w  in  (14)).  The  velocity  •  information  is  modeled  as  a  superposition  of 
the  true  velocity  and  a  measurement  error  component,  where  the  error  is  assumed  to  be  a 
zero-mean  Gaussian  signal  with  a  standard  deviation  equal  to  one  percent  of  the  nominal 
signal;  thus  the  measurements  are  modeled  as  being  relatively  free  of  error. 

In  this  simulation,  the  milling  machine  is  commanded  to  track  a  circular  contour  of 
radius  100mm  in  10.0s,  and  to  initiate  and  terminate  the  contouring  with  zero  velocity. 
The  adaptive  gain  f  is  initialized  to  zero^^jl  the  remaining  controller  parameters  are 
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set  as  follows:  =  1000,  k2  =  1000,  7  =  5,  a  =  0.001,  /?  =  10000.  The  adaptive 

controller  is  applied  to  the  milling  machine  with  a  sampling  period  of  two  milliseconds, 
and  all  integrations  required  by  the  controller  axe  implemented  using  a  simple  trapezoidal 
integration  rule  with  a  time-step  of  two  milliseconds.  The  results  of  this  simulation  are 
given  in  Figure  2,  and  indicate  that  accurate  trajectory  tracking  is  achieved  with  the 
proposed  scheme  (14).  Additionally,  this  simulation  shows  that  the  controller  is  capable  of 
providing  performance  superior  to  that  obtainable  with  a  state-feedback  adaptive  controller 
if  there  is  even  a  modest  level  of  noise  in  the  velocity  measurements. 

5.  Case  Studies:  Experimental  Results 

The  proposed  adaptive  stabilization  scheme  (14)  is  now  applied  to  an  industrial  robot 
in  a  series  of  hardware  experiments.  The  system  chosen  for  this  study  is  the  IMI  Zebra 
Zero  manipulator  (see  Figure  3).  This  robot  is  an  electrically-driven  six  DOF  revolute  joint 
arm,  and  is  composed  of  a  trunk,  shoulder,  upper  arm,  forearm,  wrist,  and  end-effector. 
The  selection  of  this  type  of  system  for  study  is  motivated  by  the  fact  that  robots  form 
an  important  and  well-known  class  of  mechanical  systems,  and  that  there  is  a  recognized 
need  for  high-performance  control  of  these  systems  in  the  presence  of  uncertainty  and 
disturbances.  In  what  follows  some  familiarity  with  robots  is  assumed,  and  the  interested 
reader  is  referred  to  [32]  for  additional  details  regarding  these  systems.  Throughout  the 
study  the  unit  of  time  is  second  and  the  unit  of  angle  is  degree. 

5.1  Position  Regulation  of  Industrial  Robots 

The  adaptive  stabilization  strategy  (14)  is  now  applied  to  an  industrial  robot  in  an 
experimental  investigation.  The  experimental  facility  utilized  for  this  study  is  the  New 
Mexico  State  University  Robotics  Laboratory  and  consists  of  a  IMI  Zebra  Zero  robot 
arm  and  the  associated  control  computer  (see  Figure  3).  The  Zebra  robot  possesses  a 
conventional  design  with  six  revolute  joints  configured  in  a  “waist-shoulder-elbow- wrist” 
arrangement,  and  the  system  dynamic  model  is  of  the  general  form  (2), (3)  with  m  = 
n  =  6;  the  precise  form  of  the  model  depends  on  the  particular  choice  of  generalized 
coordinates.  All  control  software  is  written  in  ‘C’  and  is  hosted  on  an  IBM-compatible 
486  personal  computer.  A  complete  description  of  this  flexible  and  modular  experimental 
testbed  is  beyond  the  scope  of  this  paper;  the  interested  reader  is  referred  to  [33]  for  such 
a  description.  It  is  noted,  however,  that  the  open  architecture  environment  is  designed  to 
allow  convenient  implementation  and  experimental  evaluation  of  advanced  control  schemes 
using  the  Zebra  robot,  and  therefore  provides  an  ideal  testbed  for  the  present  investigation. 
The  first  experiment  in  this  case  stu(|y_  f^nonst rates  the  capability  of  the  proposed 
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adaptive  scheme  (14)  to  provide  accurate  position  regulation.  The  Zebra  manipulator 
is  initially  at  rest  with  joint-space  position  0(0)  =  [— 45°,0°,  — 100o,0°,45o,0°]T,  and  is 
commanded  to  move  to  the  desired  final  configuration  6d  =  [15°,  60°,  —40°,  60°,  105°,  60o]T. 
Thus  each  joint  is  requested  to  move  through  60°  and  to  initiate  and  terminate  the  motion 
with  zero  velocity.  The  adaptive  gain  f  is  initialized  to  zero,  and  the  remaining  controller 
parameters  are  set  as  follows:  ki  =  10,  =  20,  7  =  5,  a  —  0.001,  ft  =  100.  The 

adaptive  controller  is  applied  to  the  robot  with  a  sampling  period  of  seven  milliseconds, 
and  all  integrations  required  by  the  controller  are  implemented  using  a  simple  trapezoidal 
integration  rule  with  a  time-step  of  seven  milliseconds.  The  results  of  this  experiment  are 
given  in  Figures  4a  and  4b,  and  indicate  that  accurate  position  regulation  is  achieved  with 
the  proposed  scheme  (14). 

5.2  Trajectory  Tracking  of  Industrial  Robots 

In  the  next  experiment,  the  adaptive  stabilization  scheme  (14)  is  applied  to  a  robot 
trajectory  tracking  problem.  While  utilizing  stabilization  control  systems  for  tracking  is 
not  new  in  robotics,  this  experiment  indicates  that  the  capacity  for  adaptation  can  permit 
even  rapid  trajectories  to  be  tracked  with  good  accuracy.  The  Zebra  manipulator  is  again 
initially  at  rest  with  the  same  joint-space  position  as  in  the  first  experiment.  The  waist  joint 
is  commanded  to  smoothly  move  from  its  initial  position  of  8\  =  —45°  to  an  intermediate 
position  of  8\  =  45°,  and  then  to  return  to  its  initial  position;  the  desired  temporal 
trajectory  for  this  motion  is  specified  to  be  6\d  —  —45°  +  45°(1  —  cos(7r/2)t)  for  t  €  [0,4]. 
At  the  same  time,  the  shoulder  joint  is  commanded  to  move  from  02  =  0°  to  02  =  57.3°  (1 
radian)  and  then  back  to  02  =  0°  according  to  the  trajectory  8 2d  =  28.7°(1  —  cos(7r/2)f) 
for  t  €  [0,4].  All  of  the  other  manipulator  joints  are  commanded  to  remain  in  their 
initial  positions.  Thus  the  manipulator  is  required  to  track  a  large  and  rapid  joint-space 
trajectory  with  no  a  priori  information  concerning  the  system  model.  The  control  strategy 
(14)  is  utilized  to  achieve  the  specified  trajectory  tracking,  and  the  sampling  period  and 
all  controller  parameters  are  set  to  the  values  used  in  the  first  experiment.  The  results  of 
this  experiment  are  given  in  Figures  5a  and  5b,  and  show  that  accurate  trajectory  tracking 
is  achieved. 

6.  Conclusions 

This  paper  introduces  a  new  adaptive  stabilization  scheme  for  mechanical  systems 
that  eliminates  the  need  for  any  knowledge  of  the  system  dynamic  model  and  that  can  be 
implemented  using  only  measurements  of  the  system  configuration.  The  control  strategy  is 
computationally  efficient,  modular,  and  repdj^g  implementable  with  a  wide  variety  of  sys- 
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terns.  It  is  demonstrated  through  computer  simulations  and  experiments  that  the  proposed 
controller  can  be  applied  to  manufacturing  tasks  of  practical  importance,  including  cutting 
force  control  in  machining,  control  of  contouring  with  a  machine  tool,  and  position  control 
of  an  industrial  robot.  In  the  cutting  force  control  example,  the  scheme  is  applied  to  obtain 
accurate  and  robust  force  control  using  only  force  measurements  and  without  any  a  priori 
information  about  the  machine  tool  or  workpiece.  The  applications  involving  the  machine 
tool  and  the  robot  illustrate  that  the  scheme  can  be  used  for  both  position  regulation  and 
trajectory  tracking,  and  confirm  that  accurate  motion  control  can  be  achieved  without 
model  information  or  rate  measurements.  Future  research  will  focus  on  the  application  of 
the  proposed  approach  to  the  problem  of  controlling  constrained  mechanical  systems. 

7.  Acknowledgments 

The  research  described  in  this  paper  was  supported  in  part  through  contracts  with  the 
Army  Research  Office  and  Sandia  National  Laboratories. 


8.  References 

1.  Marino,  R.,  “Stabilization  and  Feedback  Equivalence  to  Linear  Coupled  Oscillators”, 
International  Journal  of  Control ,  Vol.  39,  1984,  pp.  487-496 

2.  Schaft,  A.  van  der,  “Stabilization  of  Hamiltonian  Systems”,  Nonlinear  Analysis:  The¬ 
ory,  Methods,  and  Applications ,  Vol.  10,  No.  10,  1986,  pp.  1021-1035 

3.  Koditschek,  D.,  “The  Control  of  Natural  Motion  in  Mechanical  Systems”,  ASME 
Journal  of  Dynamic  Systems,  Measurement,  and  Control ,  Vol.  113,  No.  4,  1991,  pp. 
547-551 

4.  Aeyels,  D.,  “On  Stabilization  by  Means  of  the  Energy-Casimir  Method”,  Systems  and 
Control  Letters ,  Vol.  18,  1992,  pp.  325-328 

5.  Bloch,  A.,  M.  Reyhanoglu,  and  N.H.  McClamroch,  “Control  and  Stabilization  of  Non- 
holonomic  Dynamic  Systems”,  IEEE  Transactions  on  Automatic  Control  Vol.  37,  No. 
11,  1992,  pp.  1746-1757 

6.  Abesser,  H.  and  M.  Katzschmann,  “Structure-Preserving  Stabilization  of  Hamiltonian 
Control  Systems”,  Systems  and  Control  Letters ,  Vol.  22,  1994,  pp.  281-285 

7.  Takegaki,  M.  and  S.  Arimoto,  “A  New  Feedback  Method  for  Dynamic  Control  of 
Manipulators”,  ASME  Journal  of  Dynamic  Systems,  Measurement,  and  Control,  Vol. 
102,  1981,  pp.  119-125 

8.  Paden,  B.  and  R.  Panja,  “Globally  Asymptotically  Stable  ‘PD+’  Controller  for  Robot 
Manipulators” ,  International  Journal  of  Control,  Vol.  47,  No.  6,  1988,  pp.  1697-1712 

9.  McClamroch,  N.H.  and  D.  Wang,  “Feedback  Stabilization  and  Tracking  of  Constrained 
Robots”,  IEEE  Transactions  on  Automatic  Control,  Vol.  33,  No.  5,  1988,  pp.  419-426 

10.  Crouch,  P.,  “Spacecraft  Attitude  Control  and  Stabilization:  Application  of  Geometri¬ 

cal  Control  Theory  to  Rigid  Body  Models”,  JEFF  Transactions  on  Automatic  Control, 
Vol.  29,  No.  4,  1984,  pp.  321-331  2  ‘ 137 


17 


11.  Byrnes,  C.  and  A.  Isidori,  “On  the  Attitude  Stabilization  of  Rigid  Spacecraft”,  Auto- 
matica,  Vol.  27,  No.  1,  1991,  pp.  87-95 

12.  Klafterm,  R.,  T.  Chmielewski,  and  M.  Negin,  Robotic  Engineering:  An  Integrated 
Approach,  Prentice  Hall,  Englewood  Cliffs,  NJ,  1989 

13.  Nicosia,  S.  and  P.  Tomei,  “Robot  Control  by  Using  Only  Joint  Position  Measure¬ 
ments”,  IEEE  Transactions  on  Automatic  Control.  Vol.  35,  No.  9.  1990  pp  1058- 
1061 

14.  Canudas  de  Wit,  C.  and  N.  Fixot,  “Robot  Control  via  Robust  Estimated  State  Feed¬ 
back”,  IEEE  Transactions  on  Automatic  Control,  Vol.  36,  No.  12, 1991,  pp.  1497-1501 

15.  Berghuis,  H.  and  H.  Nijmeijer,  “A  Passivity  Approach  to  Controller-Observer  Design 
for  Robots”  IEEE  Transactions  on  Robotics  and  Automation,  Vol.  9.  No.  6  1993  pp 
740-754 

16.  Berghuis,  H.  and  H.  Nijmeijer,  “Global  Regulation  of  Robots  Using  Only  Position 
Measurements”  Systems  and  Control  Letters,  Vol.  21,  1993,  pp.  289-293 

17.  Feng,  W.  and  I.  Postlethwaite,  “A  Simple  Robust  Control  Scheme  for  Robot  Manip¬ 
ulators  With  Only  Joint  Position  Measurements”,  International  Journal  of  Robotics 
Research,  Vol.  12,  No.  5,  1993,  pp.  490-496 

18.  Tomei,  P.,  “Adaptive  PD  Controller  for  Robot  Manipulators”,  IEEE  Transactions  on 
Robotics  and  Automation ,  Vol.  7,  No.  4,  1991,  pp.  565-570 

19.  Kelly,  R.,  “Comments  on  ‘Adaptive  PD  Controller  for  Robot  Manipulators’  ”,  IEEE 
Transactions  on  Robotics  and  Automation,  Vol.  9,  No.  1,  1993,  pp.  117-119 

20.  Colbaugh,  R.,  “Adaptive  Point-to-Point  Motion  Control  of  Manipulators”,  Interna¬ 
tional  Journal  of  Robotics  and  Automation,  Vol.  9,  No.  2,  1994,  pp.  51-61 

21.  Saletan,  E.  and  A.  Cromer,  Theoretical  Mechanics,  Wiley,  New  York,  1971 

22.  Colbaugh,  R.  and  K.  Glass,  “On  Controlling  Robots  with  Redundancy” ,  Robotics  and 
Computer -Integrated  Manufacturing:  An  International  Journal,  Vol.  9,  No.  2,  1992, 
pp.  121-135 

23.  Colbaugh,  R.  and  K.  Glass,  “Hierarchical  Control  of  Human  Joint  Motion  Simulators”, 
Computers  and  Electrical  Engineering:  An  International  Journal,  VoL  19,  No.  3, 1993, 
pp.  213-230 

24.  Colbaugh,  R.,  K.  Glass,  and  P.  Pittman,  “Adaptive  Control  for  a  Class  of  Hamiltonian 
Systems”,  Computers  and  Electrical  Engineering:  An  International  Journal,  Vol.  20, 
No.  1,  1994,  pp.  21-38 

25.  Corless,  M.,  “Guaranteed  Rates  of  Exponential  Convergence  for  Uncertain  Systems”, 
Journal  of  Optimization  Theory  and  Applications,  Vol.  64,  No.  3,  1990,  pp.  481-494 

26.  Colbaugh,  R.,  H.  Seraji,  and  K.  Glass,  “Adaptive  Compliant  Motion  Control  for  Dex¬ 
terous  Manipulators”,  International  Journal  of  Robotics  Research.  Vol.  14  No.  3, 
1995,  pp.  270-280 

27.  Vidyasagax,  M.,  Nonlinear  Systems  Analysis,  Second  Edition,  Prentice  Hall,  Engle¬ 
wood  Cliffs,  NJ,  1993 

28.  Lauderbaugh,  L.  and  A.  Ulsoy,  “Model  Reference  Adaptive  Control  in  Milling” ,  ASME 
Journal  of  Engineering  for  Industry,  Vol.  Ill,  No.  1,  1989,  pp.  13-21 

29.  Sullivan,  G.,  “Adaptive  Control  witB  -ah^xpert  System  Based  Supervisory  Level”, 


18 


CIRSSE  Report  115,  Rensselaer  Polytechnic  Institute,  1991 

30.  Chuang,  H.  and  C.  Liu,  “Cross-Coupled  Adaptive  Feedrate  Control  for  Multiaxis  Ma¬ 
chine  Tools”,  ASME  Journal  of  Dynamic  Systems ,  Measurement,  and  Control,  Vol. 
113,  No.  3,  1991,  pp.  451-457 

31.  Lo,  C.-C.  and  Y.  Koren,  “Evaluation  of  Servo-Controllers  for  Machine  Tools”,  Proc. 
American  Control  Conference ,  Chicago,  IL,  June  1992 

32.  Spong,  M.  and  M.  Vidyasagar,  Robot  Dynamics  and  Control,  Wiley,  New  York,  1989 

33.  Glass,  K.  and  R.  Colbaugh,  “A  Robust  Open- Architecture  Experimental  Testbed  for 
Manipulator  Controller  Development”,  Robotics  Laboratory  Report  94-01,  New  Mex¬ 
ico  State  University,  February  1994 


2-139 


19 
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Figure  1:  Response  of  cutting  force  in  milling  simulation  study 


Figure  2:  Tracking  errors  for  proposed  controller  (dashed)  and  state-feedback  controller 
(solid)  in  machine  tool  contouring  simulation  study 
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Figure  3:  Zebra  Robot  Manipulator 
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Figure  4a: 


Response  of  Zebra  robot  joint  position  error  norm  (proximal  three  joints)  in 
experimental  study 


Figure  4b:  Response  of  Zebra  robot  joint  position  error  norm  (distal  three  joints)  in 
experimental  study 
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ABSTRACT 


In  high-precision  machining,  error  involving  thermal  expansion  of  the  machine  body  is  a 
major  concern.  In  order  to  counteract  the  displacement  of  the  machining  tool  due  to  thermal 
expansion  of  the  machine  body,  a  relationship  between  the  measured  temperature  distribution  of 
the  machine  and  the  displacement  of  the  tool  must  be  determined.  One  possible  solution  to  this 
problem  is  to  use  a  neural  network  to  estimate  the  displacement  of  the  tool  in  real-time  given  the 
measured  temperature  distribution.  A  feasibility  study  was  performed  using  ANSYS  to  generate 
three  time-variant  temperature  distributions  of  the  machine  body  given  three  different  thermal 
loadings.  ANSYS  was  also  used  to  determine  the  resulting  machine  tool  displacements.  A 
backpropagation  neural  network  was  then  used  to  estimate  the  displacement  of  the  machine  tool 
given  each  of  the  temperature  distributions  generated  by  ANSYS.  Finally,  a  larger  neural 
network  was  used  to  encompass  the  displacements  due  to  all  three  thermal  loading  situations. 
Evaluation  of  the  performance  of  each  neural  network  was  based  on  a  comparison  between  the 
tool  displacement  results  of  ANSYS  and  the  neural  network.  Due  to  small  errors  between  the 
neural  network  estimations  and  the  "actual"  tool  displacement  results  from  ANSYS,  it  was 
determined  that  neural  network  control  of  thermal  expansion  of  a  machine  body  is  feasible  and 
warrants  further  study. 
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INTRODUCTION 


High-precision  machining  is  an  important  part  in  improved  quality  and  performance  in 
manufacturing.  In  high-precision  machining,  tolerances  are  kept  low  improving  part  quality,  ease 
of  assembly,  and  enabling  the  production  of  more  complex,  high-performance  designs.  In 
addition,  time-consuming  finishing  processes  such  as  grinding  and  polishing  may  be  reduced  or 
eliminated.  However,  errors  due  to  backlash,  misalignment,  wear,  and  thermal  expansion  create 
difficulties  in  reducing  part  tolerances  to  very  low  levels.  Static  geometric  errors  such  as 
assembly  and  installation  misalignments,  machine  wear,  and  backlash  are  relatively  easy  to 
compensate  for  and  many  CNC  controllers  have  built-in  compensation  for  these  effects  [2]. 
However,  thermal  expansion  error,  which  may  account  for  40-70%  of  dimensional  workpiece 
error  [2],  is  much  more  difficult  to  handle. 

Thermal  errors  are  dynamic  errors  which  warrant  real-time  compensation  methods.  These 
errors  are  caused  by  thermal  expansion  and  distortion  of  machine  components  due  to  heat  sources 
such  as  motors,  bearings,  hydraulic  systems,  increasing  room  temperature,  and  sunshine.  Thermal 
expansion  and  distortion  of  the  machine  body  can  account  for  positioning  errors  on  the  order  of 
100  micrometers  [2]. 

For  real-time  compensation  of  the  thermal  expansion  of  the  machine  body,  a  relationship 
between  the  measured  temperature  distribution  of  the  machine  body  and  the  displacement  of  the 
tool  location  must  be  determined.  Due  to  the  highly  non-linear  behavior  of  thermal  error,  a 
prediction  of  tool  displacement  using  traditional  heat  transfer  analysis  is  extremely  difficult. 
Therefore,  other  methods  of  solution  must  be  examined.  Non-linear  adaptability  and  fast 
execution  time  make  neural  networks  a  good  candidate  for  the  solution  to  this  problem. 

The  objective  of  this  study  is  to  asses  the  feasibility  of  using  neural  networks  to  estimate 
tool  displacement  due  to  thermal  expansion  of  a  manufacturing  machine  body.  This  estimation 
may  then  be  used  for  real-time  compensation  of  this  kind  of  thermal  error.  For  simplicity  the 
machine  body  is  modeled  as  a  simple  two-dimensional  solid  which  is  subjected  to  three  types  of 
thermal  loadings.  A  finite  element  package,  ANSYS,  is  used  to  generate  three  time-variant 
temperature  distributions  and  the  resulting  tool  displacement  trajectory  for  each  thermal  load  case. 
Three  similar  neural  networks  were  then  developed  to  estimate  the  machine  tool  displacement 
given  a  set  of  discrete  temperature  values  from  the  three  different  temperature  profiles.  The 
performance  of  each  neural  network  was  evaluated  separately  based  on  a  comparison  between  the 
neural  network  estimation  and  the  tool  displacement  results  from  ANSYS.  A  comparison  is  then 
made  to  determine  which  case  of  thermal  expansion  is  most  difficult  for  a  neural  network  to 
estimate.  Finally,  the  three  thermal  loading  schemes  are  combined  and  a  single  neural  network  is 
developed  to  estimate  the  tool  displacement  due  to  any  of  the  thermal  loading  schemes.  From  the 
performance  of  this  fairly  general  neural  network,  the  feasibility  of  using  neural  networks  to 
estimate  tool  displacement  due  to  thermal  expansion  of  a  machine  body  is  assessed. 
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PROBLEM  FORMULATION 


The  first  step  in  the  approach  to  this  feasibility  study  is  to  model  the  problem  in  a 
simplified  but  meaningful  manner.  This  involves  defining  a  simplified  geometry  of  the 
manufacturing  machine  body,  defining  thermal  and  structural  boundary  conditions,  and  defining 
three  simple  but  fairly  realistic  thermal  loading  schemes.  See  Figure  #1  for  a  schematic  of  the 
machine  model  used. 


FIGURE  #  1:  Schematic  of  Machine  Body 


Geometry  and  Composition 

The  machine  body  is  modeled  as  a  two-dimensional  solid  shaped  like  a  C-shaped  machine 
frame  without  the  cutting  and  positioning  table.  Refer  to  Figure  #1  for  dimensions  of  machine 
model  and  location  of  machine  tool.  It  is  assumed  the  machine  is  made  of  plain  carbon  steel  with 
the  following  properties: 

Density  =  7.854  x  10'3  kg/m3  [3], 

Specific  heat  =  434  J/kgK  [3], 
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Thermal  Conductivity  =  60.5  W/mK  [3], 

Modulus  of  elasticity  =  200  GPa  [1], 

Poisson's  ratio  =  0.29  [1],  and 

Thermal  expansion  coefficient  =  1 1.7  x  10"6  /°C  [1]. 


Boundary  Conditions 

Structurally,  the  machine  is  modeled  to  be  supported  by  a  pin  and  roller  connection  on  the 
floor.  At  the  floor,  the  machine  is  also  assumed  to  be  insulated.  The  ambient  or  room 
temperature  is  assumed  to  be  at  a  constant  temperature  of  20°  C.  Initially,  the  system  is  assumed 
to  be  at  steady  state  with  the  machine  at  an  initial  temperature  of  20°  C.  Also  it  is  assumed  there 
are  no  external  forces  acting  on  the  system  other  than  reactions  at  the  floor. 

Determination  of  Convection  Coefficients 


The  machine  is  assumed  to  be  exposed  to  convection  on  all  sides  except  the  floor  surface. 
The  convection  coefficients  at  these  surfaces  are  estimated  using  empirical  correlations  [3].  See 
Appendix  A  for  calculations.  The  top  surface  and  all  vertical  surfaces  are  associated  with  a 
convection  coefficient  of  5  W/m2^  The  bottom  exposed  surface,  however,  is  associated  with  a 
smaller  convection  coefficient  of  2  W/m^. 

Thermal  Load  Cases 


The  following  thermal  load  cases  are  simple  but  each  represent  a  fairly  realistic  thermal 
loading  of  a  manufacturing  machine.  Figure  #  2  shows  the  schematic  of  each  thermal  loading 
used  in  this  study. 


THERMAL  LOADING#  1 


QX  -  200  W/m 


THERMAL  LOADING  #2 


Q2 

TTT 


THERMAL  LOADING  #3 


Q3  =  50,000  W/m2 
04=241  W 


FIGURE  #  2:  Thermal  Load  Case  Schematics 
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The  first  thermal  load  is  the  heating  of  one  side  of  the  machine  due  to  sunlight  or  heat  sources 
such  a  motor  or  hydraulic  system.  The  second  thermal  load  is  the  uniform  heating  of  all  sides 
which  may  be  due  to  an  effect  such  as  increasing  room  temperature.  The  third  thermal  load 
scheme  includes  internal  heat  generation  due  to  a  heat  source  such  as  an  internal  motor  and  a 
point  heat  source  at  the  tool  location  due  to  heating  of  the  tool  during  cutting.  The  heat  source, 
heat  flux,  and  heat  generation  values  used  in  this  study  are  strictly  estimations  and  do  not 
necessarily  represent  actual  values. 

TEMPERATURE  PROFILES  AND  ACTUAL  TOOL  DISPLACEMENTS 

The  finite  element  software  package,  ANSYS,  is  used  to  take  the  thermal  loading  and 
boundary  conditions  to  calculate  the  time-variant  temperature  profile  for  each  of  the  thermal  load 
cases.  After  this  is  completed,  it  is  possible  for  ANSYS  to  use  these  temperature  profile  results 
and  determine  the  displacement  trajectory  of  the  tool  location.  The  displacement  trajectories 
generated  by  ANSYS  are  considered  to  be  actual  machine  tool  displacements.  In,  addition, 
"measurements"  of  machine  temperature  are  taken  to  be  discrete  temperature  values  from  the 
time-variant  temperature  profiles  at  selected  locations  on  the  outer  surfaces  of  the  machine.  The 
finite  element  model  consists  of 342  elements  and  385  nodes.  All  nodes  are  uniformly  spaced  1 
inch  apart  so  that  the  size  of  each  element  is  1  inch  x  1  inch.  Appendix  B  contains  a  complete 
ANSYS  program  file  used  to  calculate  the  time-variant  temperature  profile  and  resulting 
displacement  trajectory  of  the  tool  location  for  the  first  thermal  load  case.  Appendix  C  and 
Appendix  D  contain  the  changes  needed  for  the  second  and  third  thermal  loadings,  respectively. 

Generation  of  Time-Variant  Temperature  Profiles 

To  generate  a  time-variant  temperature  profile  in  ANSYS  a  transient  thermal  analysis  must 
be  performed  which  uses  the  defined  thermal  loading,  boundary  conditions,  initial  condition, 
convection  coefficients,  and  material  properties.  The  total  time  span  analyzed  is  2  hours  which 
must  be  broken  into  small  increments  for  solution  accuracy.  At  every  time  increment,  a 
temperature  distribution  of  the  model  is  calculated.  It  is  expected  that  most  dynamic  effects  occur 
when  a  body  is  first  subjected  to  a  thermal  load.  Therefore,  solution  increments  should  be  smaller 
at  first.  See  Table  #1  for  the  time  increments  used  in  this  analysis. 


Time  Interval 
0-10  min 
10-20  min 
20-40  min 
40-120  min 


Increment  Size 
30  sec 

1  min 

2  min 
4  min 


TABLE  #  1:  Thermal  Solution  Increment  Size 
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After  the  thermal  solution  is  complete,  an  entire  temperature  profile  of  the  model  is 
determined  at  each  solution  increment.  From  this  data,  a  temperature  history  of  selected  nodes 
can  be  extracted  to  be  used  as  temperature  "measurement"  inputs  to  the  neural  networks. 

Determination  of  Actual  Tool  Displacements 

After  the  thermal  solution  is  completed,  ANSYS  can  use  the  temperature  profile  results, 
structural  boundary  conditions,  and  material  properties  to  determine  the  displacement  trajectory 
of  the  tool  location.  To  do  this,  a  static  structural  analysis  is  performed  to  compute  the  thermal 
expansion  of  the  finite  element  model  at  desired  time  steps.  ANSYS  is  then  used  to  determine  the 
x  and  y  displacement  components  of  the  model  at  the  tool  location.  This  information  is  used  to 
generate  the  displacement  trajectory  of  the  machine  tool.  In  this  analysis,  the  x  and  y 
displacement  components  of  the  tool  location  are  computed  every  2  minutes  for  the  first  40 
minutes  and  every  4  minutes  thereafter. 

DEVELOPMENT  OF  NEURAL  NETWORKS 


The  neural  networks  developed  in  this  study  use  temperature  "measurements"  generated 
by  ANSYS  as  inputs  and  output  estimations  of  the  x  and  y  components  of  the  tool  displacement. 
See  Figure  #3  for  a  functional  representation  of  the  neural  networks. 


Tool  Location 
Displacement 
Components 


dx 


"►dy 


FIGURE  #  3:  Black-box  Representation  of  Neural  Network 
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A  modified  backpropagation  technique  is  used  to  train  the  neural  networks  so  that  the 
output  estimation  of  tool  displacement  is  sufficiently  close  to  the  "actual"  values  generated  by 
ANSYS.  If  a  neural  network  is  unable  to  sufficiently  estimate  the  tool  displacement  after  a  certain 
amount  training,  the  network  structure  is  modified  and  training  is  repeated.  Evaluation  of  each 
neural  network  performance  is  based  on  a  comparison  between  the  tool  displacement  results  of 
ANSYS  and  the  neural  network. 

Data  Preprocessina 

From  the  thermal  solution,  a  temperature  history  of  selected  nodes  can  be  extracted  by 
ANSYS  to  be  used  as  temperature  "measurement"  inputs  to  the  neural  networks.  Therefore,  it 
must  be  determined  which  nodes  to  use  for  these  temperature  "measurements".  Nodes  on  the 
exterior  surfaces  of  the  model  are  a  practical  choice  because  here  is  where  it  would  be  easiest  to 
mount  temperature  measuring  devices  such  thermocouples.  In  addition,  it  was  found  that  thermal 
displacements  in  the  x-direction  were  larger  than  in  the  y-direction.  So  more  nodes  are  chosen  on 
the  horizontal  surfaces  than  on  the  vertical  surfaces.  Figure  #4  shows  which  nodes  are  for  the 
selected  temperature  "measurement"  locations.  There  are  a  total  of  16  nodes  selected. 


FIGURE  #  4:  Nodes  Selected  For  Temperature  "Measurements” 


The  temperature  "measurements"  must  be  taken  at  a  selected  sampling  period.  For  the 
temperature  data  to  correspond  with  the  tool  trajectory  data,  a  "measurement"  is  taken  every  2 
minutes  for  the  first  40  minutes  and  every  4  minutes  thereafter.  This  sampling  method  was  found 
to  sufficiently  represent  the  dynamics  of  all  types  of  thermal  expansion. 

Now,  the  temperature  and  displacement  data  must  be  normalized  before  it  can  be  used 
with  a  neural  network.  The  type  of  neural  network;  used  in  this  study  accepts  inputs  in  the  range 
of -1 .0  to  +1.0  and  during  training,  the  desired  or  actual  outputs  must  be  in  the  range  of  0.0  to 
+1.0.  Therefore,  to  avoid  errors  associated  with  ne^l  network  operation  at  the  endpoints  of 
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these  ranges,  the  temperature  data  is  normalized  to  be  between  -0.9  and  +0.9  and  the 
displacement  data  is  normalized  to  be  between  +0. 1  and  +0.9.  Equations  1  and  2  are  used  to 
normalize  the  temperature  and  displacement  data,  respectively. 


Tn=  0.9* 


T-(Ta 


Tmax 


Tmax-^n- 


(1). 


<ftn  =  0.5  +  0.8* 


dx-idx^ 


dxxMxrdx- 


dxmla-iJx„ 


and 


dy„  =  0.5  +  0.8  * 


dymax~<fy„ 


tfyiaaiC-dyn 


(2). 


Where:  T,  dx,  and  dy  are  the  temperature  and  displacement  data, 

Ta,  dx,,,  and  dyn  are  the  normalized  temperature  and  displacement  data, 
Tmin>  domin’  ^  ^Ymm  ««  minimum  temperature  and  displacements,  and 
T^,  dx^  and  dy^  are  maximum  temperature  and  displacements. 


The  neural  network  training  sets  consist  of  the  16  normalized  temperature  "measurement" 
input  features  and  the  2  normalized  desired  output  components.  Each  of  the  16  input  features  and 
the  2  desired  output  vectors  used  in  the  training  set  consist  of  every  other  data  point  of  the  entire 
time  span.  In  other  words,  a  data  point  is  used  for  the  training  set  every  4  minutes  for  the  first  40 
minutes  and  every  8  minutes  thereafter. 

To  test  the  performance  of  each  trained  neural  network,  a  testing  set  is  made  up  of  the  16 
input  features  using  all  data  points  of  the  entire  time  span.  Therefore  during  testing,  the  neural 
networks  essentially  interpolate  between  training  data  points. 

In  combining  the  three  thermal  loading  schemes,  the  following  table  summarizes  the 
approach  taken  for  all  input  and  desired  output  features.  As  before,  every  data  point  is  used  in  the 
testing  set  and  every  other  data  point  is  used  in  the  training  set. 


Time 

0 

0 

0 

1 

1 

1 


Thermal  Load  Case  # 
1 
2 
3 
1 
2 
3 


TABLE  #  2;  Combination  Qf'Oo'ee  Thermal  Load  Cases 
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Neural  Network  Training  Scheme 


The  neural  networks  used  in  this  study  have  the  basic  structure  shown  in  Figure  #5.  The 
structure  is  a  three  layer,  fully  inter-connected  structure  where  each  connection  is  associated  with 
a  weight  value.  The  input  layer  consists  of  16  nodes  corresponding  to  "measurements"  of  the 
machine  temperature  distribution.  The  output  layer  consists  of  2  nodes  corresponding  to  the 
displacement  components  of  the  machine  tool.  Between  the  input  and  output  layers  is  a  hidden 
layer  which  performs  feature  extraction  and  noise  suppression  [2], 


The  first  step  in  training  the  neural  networks  is  gathering  and  preprocessing  sufficient 
input  and  output  data.  Then  the  learning  parameters,  or  weights,  of  the  neural  network  are 
initialised  randomly.  Now,  the  input  temperature  data  is  applied  to  the  network  and  the  outputs 
are  compared  to  the  known  tool  displacement  outputs.  The  weights  are  then  adjusted  by  a 
backpropagation  rule  which  essentially  adjusts  the  weights  in  the  direction  of  the  steepest  error 
gradient.  In  normal  backpropagation,  the  weights  are  adjusted  in  this  manner  until  the  system 
output  error  reaches  a  desired  level.  The  system  output  error  is  defined  as 

(kil  +  M)2  (3). 

Where:  E  =  system  output  error, 

n  =  number  of  training  samples, 

e,  =  error  between  known  and  estimated  x-component  of  tool  displacement,  and 
e2  =  error  between  known  and  estimated  y-component  of  tool  displacement. 
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However,  with  normal  backpropagation,  neural  network  training  might  lead  to  a  relative 
or  local  error  minimum  instead  of  the  absolute  or  global  minimum.  The  modified  technique  used 
in  this  study,  is  intended  to  eliminate  this  problem.  As  in  normal  backpropagation,  weights  of  the 
neural  network  are  initialized  randomly,  the  input  temperature  data  is  applied  to  the  network  and 
the  outputs  are  compared  to  the  known  tool  displacement  outputs.  The  weights  are  adjusted  by  a 
backpropagation  rule  for  100  iterations.  The  final  weights,  system  error,  and  convergence  rate 
are  saved.  The  weights  are  then  reinitialized  and  adjusted  for  100  more  iterations.  This  process  is 
repeated  100  times.  When  finished,  the  training  scheme  uses  the  weights  from  the  best  network 
and  continues  with  normal  backpropagation  until  the  desired  error  level  is  reached.  The  best 
solution  is  determined  by  a  linear  combination  of  the  final  system  error  and  the  error  convergence 
rate. 

Testing  and  Evaluation  of  Neural  Networks 

Trained  neural  networks  are  tested  by  applying  the  testing  set  which  consists  of  the  16 
input  temperature  "measurements"  over  the  entire  time  span.  Evaluation  of  individual  neural 
network  performance  is  based  on  the  percentage  error  of  estimation  to  actual  tool  displacement 
for  both  x  and  y  directions.  A  comparison  is  made  to  determine  which  case  of  thermal  expansion 
is  most  difficult  for  a  neural  network  to  estimate.  This  is  done  based  on  the  system  output  errors 
and  the  time-dependant  percentage  errors.  For  comparison  validity,  neural  networks  of  all 
thermal  expansion  cases  were  trained  for  the  same  amount  of  time. 

ANALYSIS  OF  FIRST  THERMAL  LOADING 


Time-Vflrient  Temperature  Profile 

The  temperature  profile  generated  by  ANSYS  of  the  machine  body  with  a  constant  heat 
flux  applied  to  one  side  of  the  machine  is  shown  in  Figure  #6.  This  profile  is  generated  at  the  end 
of  the  time  span,  2  hours.  It  can  be  seen,  that  the  temperature  distribution  is  nearly  constant  in  the 
vertical  direction,  this  implies  most  thermal  expansion  will  occur  in  the  horizontal  direction.  It 
should  also  be  noted  that  the  maximum  temperature  change  occurring  in  the  machine  body  is  less 
than  2°C  and  the  temperature  gradient  from  the  left-hand  side  to  the  right-hand  side  is  less  than 
1°C. 
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FIGURE  #  6:  Final  Temperature  Profile  (Thermal  Load  #1) 

Actual  Tool  Location  Displacement 

The  tool  displacement  results  generated  by  ANSYS  for  a  uniform  heat  flux  applied  to  the 
left  side  of  the  machine  are  shown  in  Figure  #7.  The  tool  moves  approximately  7pm  in  the 
x-direction  and  3pm  in  the  y-direction.  Also,  there  is  a  hinge  effect  that  takes  place  in  the  first  20 
minutes,  then  the  displacement  becomes  linear. 


ANSYS  Tool  Displacement  Results 


Time  (min) 

FIGURE  #  7:  Actual  Tool  placement  (Thermal  Load  #1) 
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Performance  of  Neural  Network 


A  3  layer  neural  network  with  10  nodes  in  the  hidden  layer  was  trained  to  produce  the 
error  shown  in  Figure  #8.  The  estimation  results  for  this  thermal  loading  are  within  3%  of  the 
tool  displacement  determined  by  ANSYS.  The  percentage  errors  are  larger  in  the  early  stages  of 
expansion  which  may  be  due  to  extremely  small  temperature  gradients.  These  small  gradients  are 
more  susceptible  to  errors  such  as  round-off  errors.  However,  after  approximately  30  minutes  the 
percentage  error  converges  to  zero  . 


NEURAL  NET  DISPLACEMENT  ESTIMATION 

Percentage  Error  of  Ansys  Results 


Time  (min) 


— X-direction  — Y-direction 


FIGURE  #  8:  Neural  Network  Performance  (Thermal  Load  #1) 


ANALYSIS  OF  SECOND  THERMAL  LOADING 
Time-Varient  Temperature  Profile 

The  temperature  profile  of  the  machine  body  with  a  constant  heat  flux  applied  to  all 
exposed  sides  of  the  machine  is  shown  in  Figure  #9.  This  profile  is  generated  at  the  end  of  the 
time  span,  2  hours.  As  expected,  the  machine  is  coolest  at  the  bottom  insulated  surface  and 
warmest  near  the  exposed  comers.  The  maximum  temperature  change  occurring  in  the  machine 
body  over  the  time  span  just  over  2°C  and  the  temperature  gradient  from  warmest  to  coolest  after 
2  hours  is  less  than  1°C. 
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FIGURE  #9:  Final  Temperature  Profile  (Thermal  Load  #2) 


Actual  Tool  Location  Displacement 

The  tool  displacement  results  generated  by  ANSYS  for  a  uniform  heat  flux  applied  to  all 
sides  of  the  machine  are  shown  in  Figure  #10.  The  tool  moves  approximately  9.5pm  in  the 
x-direction  and  6.5pm  in  the  y-direction.  Both  displacement  components  become  mostly  linear 
after  approximately  10  minutes. 


ANSYS  Tool  Displacement  Results 


FIGURE  #  10;  Actual  Tool  displacement  (Thermal  Load  #2) 
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Performance  of  Neural  Network 

A  3  layer  neural  network  with  10  nodes  in  the  hidden  layer  was  trained  to  produce  the 
error  for  the  second  thermal  loading  shown  in  Figure  #1 1 .  This  neural  network  is  similar  to  the 
network  trained  for  the  first  thermal  loading  except  the  weights  are  different.  The  estimation 
results  for  this  thermal  loading  are  within  5%  of  the  tool  displacement  determined  by  ANSYS. 
The  percentage  errors  are  again  larger  in  the  early  stages  of  expansion  which  may  be  due 
round-off  errors.  After  approximately  40  minutes  the  percentage  error  converges  nicely  to  zero. 


NEURAL  NET  DISPLACEMENT  ESTIMATION 

Percentage  Error  of  Ansys  Results 


X-direction  — Y-direcGon 


FIGURE  #  11;  Neural  Network  Performance  (Thermal  Load  #2) 


ANALYSIS  OF  THIRD  THERMAL  LOADING 
Time-Varient  Temperature  Profile 

The  machine  body  temperature  profile  resulting  from  heat  generation  in  the  center  of  the 
machine  and  a  point  heat  source  due  to  tool  heating  is  shown  in  Figure  #12.  This  profile  shows 
the  state  of  the  machine  at  the  end  of  2  hours.  As  expected,  the  machine  is  coolest  at  the  bottom 
insulated  surface  and  wannest  in  the  center  near  the  heat  generation  source.  The  point  heat 
source  applied  at  the  tool  location  does  not  seem  to  affect  the  machine  body.  However,  it  is 
possible  that  the  strength  of  the  heat  source  used  is  not  realistic.  The  maximum  temperature 
change  occurring  in  the  machine  body  over  the  time  span  less  than  3°C  and  the  temperature 
gradient  from  warmest  to  coolest  after  2  hours  is  less  than  1.5°C. 
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Actual  Tool  Location  Displacement 


Figure  #13  shows  the  tool  displacement  trajectory  generated  by  ANSYS  for  die  third 
thermal  loading.  It  can  be  seen  that  the  tool  moves  approximately  9.5pm  in  the  x-direction  and 
8pm  in  the  y-direction.  Again,  both  displacement  components  become  mostly  linear  after  a  short 
period  of  time,  approximately  10  minutes. 


ANSYS  Tool  Displacement  Results 
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FIGURE  #  13:  Actual  Tool  Displacement  (Thermal  Load  #3) 
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Performance  of  Neural  Network 


The  neural  network  displacement  estimation  performance  for  the  third  thermal  loading  is 
shown  in  Figure  #14.  The  neural  network  used  consists  of  3  layers  with  10  nodes  in  the  hidden 
laye,  similar  to  the  networks  trained  for  the  first  and  second  thermal  loadings.  Figure  #14  shows 
the  estimation  results  for  this  thermal  loading  are  within  3%  of  the  tool  displacement  determined 
by  ANSYS.  This  percentage  error  is  a  bit  larger  during  the  first  hour  than  those  of  the  first  and 
second  thermal  loadings,  indicating  the  relationship  between  the  temperature  "measurements"  and 
the  machine  tool  displacement  is  not  quite  as  good  during  this  time.  Possibly  a  longer  training 
time  for  the  neural  network  is  needed  to  better  fit  the  features  of  this  relationship. 


NEURAL  NET  DISPLACEMENT  ESTIMATION 

Percentage  Error  of  Ansys  Results 


— X-direction  — Y-direcfon 


FIGURE  #  14:  Neural  Network  Performance  (Thermal  Load  #3) 


ANALYSIS  OF  COMBINED  THERMAL  LOADING 


Performance  of  Neural  Network 

The  performance  of  the  neural  network  encompassing  all  three  thermal  load  cases  is 
shown  in  Figure  #15.  The  percentage  error  of  the  estimated  displacement  to  the  displacement 
generated  by  ANSYS  is  well  under  1.5%  for  the  entire  time  span.  The  method  of  combining  the 
three  sets  of  data  is  probably  responsible  for  the  high  frequency  oscillation  seen  in  the  percentage 
error  plot.  A  3  layer  neural  network  with  20  nodes  in  the  hidden  layer  is  used  to  generate  these 
results.  It  is  unexpected  that  such  a  small  network  will  perform  so  well  for  a  fairly  general  set  of 
conditions. 
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NEURAL  NET  DISPLACEMENT  ESTIMATION 

Percentage  Error  of  Ansys  Results 
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FIGURE  #  15:  Neural  Network  Performance  (Combined  Thermal  Load) 


DISCUSSION  OF  RESULTS 


The  machine  body  temperature  gradients  generated  in  this  study  are  less  half  of  the  4°C 
gradient  found  by  Chen  in  his  examination  of  thermal  expansion  of  a  CNC  machine  [2],  Chen's 
experiment  was  run  for  7  hours  while  this  study  examines  a  2  hour  time  span.  This  implies  the 
thermal  expansions  used  in  this  study  are  reasonable  estimations  of  what  may  be  expected  in 
actual  applications. 

Neural  network  performance  for  all  three  thermal  load  cases  and  the  combined  thermal 
load  case  is  summarized  in  Table  #3.  It  can  be  seen  that  the  neural  network  structure  used  trained 


Thermal 

Neural  Network 

Training 

System  Output 

Max.  Percent 

Load  Case 

Structure 

Time 

(iterations) 

Error 

Error 

(%) 

#1 

1  hidden  layer  with 

10  nodes 

60,000 

0.000011 

2.684 

#2 

1  hidden  layer  with 

10  nodes 

60,000 

0.000043 

4.661 

#3 

1  hidden  layer  with 

10  nodes 

60,000 

0.000031 

2.415 

Combined 

1  hidden  layer  with 

20  nodes 

60,000 

0.000042 

1.235 

TABLE  #  3:  Comparison  o#Niftlral  Network  Performance 
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better  on  the  first  and  third  thermal  loadings  than  the  second.  This  may  be  due  to  the  fact  that 
there  is  a  total  of  5  separate  loads  applied  to  the  perimeter  of  the  machine.  The  first  thermal  load 
case  only  has  1  load  applied  and  the  second  load  case  has  2  loads  applied.  The  system  output 
error  of  the  first  thermal  loading  is  significantly  lower  than  the  third  thermal  loading  due  to  the 
fact  the  error  did  not  converge  to  zero  as  quickly  in  the  third  thermal  load  case.  This  effect  is 
probably  due  to  the  fact  that  the  temperature  "measurements"  were  all  taken  on  the  perimeter  of 
the  machine  body  and  could  not  represent  the  internal  heat  generation  expansion  as  efficiently  as 
the  expansion  due  to  a  heat  flux  applied  to  an  outside  surface.  This  theory  is  corroborated  by  the 
fast  convergence  of  the  second  load  case  which  also  has  only  thermal  loads  applied  to  the  exterior 
of  the  machine. 

The  comprehensive  neural  network  developed  in  this  study  performed  exceptionally  well. 
It  has  only  double  the  nodes  of  the  individual  neural  networks  with  an  maximum  percentage  error 
of  only  1.2%.  The  method  of  combining  the  three  sets  of  data  is  probably  responsible  for  small 
high  frequency  oscillations  in  the  percentage  error.  As  a  result,  the  system  output  error  is  a  bit 
higher  than  those  of  the  individual  thermal  load  neural  networks.  All  neural  networks  exhibit  a 
low  frequency  oscillatory  nature  in  the  percentage  error.  It  is  not  known  at  this  time  what  causes 
this  effect. 

It  should  also  be  noted  that  all  networks  used  in  this  study  are  fairly  small  neural 
networks.  As  a  rule  of  thumb,  the  number  of  nodes  in  the  hidden  layer  should  at  least  double  the 
number  of  inputs  to  the  neural  network.  Then  for  this  system,  a  reasonable  size  network  consists 
of  approximately  32  nodes  in  the  hidden  layer.  Each  of  these  networks,  even  the  comprehensive 
network  is  significantly  smaller  than  this  rule  of  thumb.  This  would  imply  there  is  quite  a  bit  of 
room  to  add  complexity  to  the  problem  without  generating  an  extremely  large  neural  network. 


CONCLUSIONS 


Due  to  the  exceptional  performance  exhibited  by  the  comprehensive  neural  network,  it  is 
determined  that  neural  network  estimation  of  thermal  expansion  of  a  machine  body  is  feasible  at 
this  level  of  complexity.  In  addition,  it  was  found  that  a  relatively  small  neural  network  can 
estimate  the  thermal  displacement  of  the  machine  tool  with  less  than  a  1.5%  error.  Of  the  three 
types  of  thermal  loading  investigated,  thermal  expansion  due  to  a  constant  heat  flux  applied  to  all 
exposed  surfaces  was  the  most  difficult  for  a  neural  network  to  estimate.  This  is  probably  due  to 
the  feet  that  this  type  of  loading  consists  of  the  most  individual  thermal  loads.  It  was  also  found 
that  in  the  early  stages  of  thermal  expansion  due  to  internal  heat  generation,  it  is  slightly  more 
difficult  for  a  neural  network  to  predict  tool  displacement  using  temperature  "measurements" 
taken  from  the  exterior  of  the  machine  body.  Finally,  all  neural  networks  used  in  this  study  exhibit 
a  low  frequency  oscillatory  nature  in  percentage  error.  It  is  not  known  at  this  time  what  causes 
this  effect. 
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RECOMMENDATIONS 


Further  work  in  this  area  of  study  should  include  an  optimization  of  temperature  sensor 
location  for  satisfactory  neural  network  estimation.  Also,  the  effects  of  temperature  measurement 
noise  or  error  should  be  determined.  This  could  include  adding  a  Gaussian  white  noise  or  other 
type  of  noise  to  the  temperature  results  obtained  from  ANSYS  and  evaluating  the  effect  on  neural 
network  performance. 
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A  ROBUST  APPROACH  TO  REAL-TIME 
TOOL  WEAR  ESTIMATION 


R.  Colbaugh  K.  Glass 

Department  of  Mechanical  Engineering 
New  Mexico  State  University,  Las  Cruces,  NM  88003 

Abstract 

This  paper  presents  a  robust  strategy  for  estimating  tool  wear  in  metal  cutting  opera¬ 
tions.  The  proposed  estimation  algorithm  consists  of  two  components:  a  recurrent  neural 
network  to  model  the  tool  wear  dynamics,  and  a  robust  observer  to  estimate  the  tool  wear 
from  this  model  using  measurements  of  cutting  force.  It  is  shown  that  the  algorithm  en¬ 
sures  that  the  tool  wear  estimation  error  is  uniformly  bounded  in  the  presence  of  bounded 
unmodeled  effects,  and  that  the  ultimate  bound  on  this  error  can  be  made  as  small  as  de¬ 
sired.  The  proposed  approach  is  applied  to  the  problem  of  estimating  tool  wear  in  turning 
and  is  shown  to  provide  wear  estimates  which  are  in  close  agreement  with  experimental 
results. 

1.  Introduction 

There  is  considerable  interest  in  increasing  the  level  of  automation  in  manufacturing 
processes.  Included  among  the  anticipated  benefits  of  this  automation  are  enhanced  pro¬ 
ductivity,  improved  product  quality,  and  reduced  manufacturing  costs.  However,  progress 
toward  the.  development  of  automated  manufacturing  systems  has  been  hindered  by  the 
lack  of  dependable  machine  tool  diagnostic  schemes.  In  particular,  the  problem  of  reliably 
measuring  the  tool  wear  has  been  a  significant  obstacle  to  the  realization  of  effective  auto¬ 
mated  machining  systems.  The  approaches  proposed  thusfar  for  measuring  tool  wear  can 
be  divided  into  two  broad  classes:  direct  methods  and  indirect  methods.  In  the  direct  ap¬ 
proach,  tool  wear  is  measured  directly  using,  for  example,  microscopes  or  computer  vision 
technology.  While  this  method  can  provide  very  accurate  wear  information,  it  is  inherently 
an  off-line  approach  and  is  therefore  limited  in  its  applicability.  Alternatively,  the  indirect 
approach  to  tool  wear  measurement  is  to  estimate  the  wear  state  from  measurable  cutting 
variables,  such  as  cutting  force  or  acoustic  emission.  Observe  that  the  indirect  method  can 
be  used  for  real-time  tool  wear  determination  and  consequently  has  considerable  poten¬ 
tial  for  use  in  automated  machining  systems.  However,  the  accuracy  of  existing  indirect 
techniques  is  often  unsatisfactory.  2  - 165 


1 


Recognizing  the  importance  of  reliably  estimating  the  machine  tool  wear  state  in  real¬ 
time,  several  researchers  have  studied  this  problem  in  recent  years  [e.g.,  1-10].  Early 
work  focused  on  the  application  of  statistical  methods  to  associate  patterns  in  measurable 
signals  with  wear  states  [e.g.,  1,2].  More  recently,  it  has  been  proposed  that  this  pattern 
recognition  could  be  accomplished  effectively  using  neural  networks  [3-5].  A  different 
approach  to  tool  wear  estimation  is  to  acknowledge  that  tool  wear  evolves  as  a  dynamical 
system  during  the  cutting  process,  and  to  apply  system  theoretic  ideas  to  the  estimation  of 
wear  state  [e.g.,  6-10].  The  principle  difficulty  associated  with  this  strategy  is  the  nonlinear 
nature  of  the  process  model  and  the  lack  of  good  estimates  for  the  model  parameter  values, 
so  that  traditional  state  observation  techniques  are  difficult  to  apply.  Nevertheless,  this 
method  has  considerable  potential  because  the  formulation  is  compatible  with  the  dynamic 
character  of  the  tool  wear  process. 

This  paper  presents  a  new  strategy  for  the  real-time  estimation  of  tool  wear  in  metal 
cutting  operations.  The  proposed  estimation  algorithm  consists  of  two  components:  a 
recurrent  neural  network  to  model  the  tool  wear  dynamics,  and  a  robust  observer  to 
estimate  the  tool  wear  from  this  model  using  measurements  of  cutting  force.  Thus  the 
approach  is  to  train  the  neural  network  so  that  it  provides  a  good  model  for  the  nonlinear 
and  uncertain  tool  wear  dynamical  system,  and  then  to  design  the  observer  to  ensure  good 
performance  of  the  overall  system  in  the  presence  of  (inevitable)  modeling  error.  A  stability 
analysis  is  conducted  to  show  that  tool  wear  estimation  error  is  uniformly  bounded  with 
an  ultimate  bound  that  can  be  reduced  arbitrarily.  The  efficacy  of  the  proposed  approach 
is  demonstrated  through  a  case  study  involving  the  estimation  of  tool  wear  in  a  turning 
application;  it  is  shown  in  this  study  that  the  algorithm  provides  wear  estimates  which  are 
in  close  agreement  with  experimental  results. 

2.  Preliminaries 

This  paper  considers  the  problem  of  estimating  in  real-time  the  tool  wear  state  during 
machining  operations.  Numerous  researchers  have  studied  the  cutting  process  and  pro¬ 
posed  models  for  tool  -wear.  For  example,  Park  and  Ulsoy  [9]  have  proposed  that  flanl- 
wear  dominates  the  tool  wear  process  in  many  important  applications,  and  have  proposed 
the  following  model  for  this  component  of  tool  wear: 


(1) 


«7i  =  -hvwfi  +  k2  — 

Jd 

Wf2  =  &3\/^exp(-^) 

F  —  k$fnd  +  +  k%d{wjx  +  t 0/2) 
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where  Wf. ,  and  Wf2  are  the  flank  wear  components  due  to  abrasion  and  diffusion,  respec¬ 
tively,  Wf  =  wfl  +Wf2  is  the  total  flank  wear,  /,  v.  d  are  the  feed,  cutting  speed,  and  depth 
of  cut,  respectively,  and  are  the  inputs  for  the  system,  and  F  is  the  cutting  force  and  is 
the  output  (measurable  variable)  for  the  system.  In  (1),  n  and  the  kt  are  constants  which 
depend  on  the  effective  rake  angle,  workpiece  material,  and  other  task-related  parameters, 
and  T  is  the  tool- work  temperature  and  depends  on  the  inputs  /,  v  and  the  state  ,  «?/  . 
Observe  that  the  tool  wear  model  (1)  is  a  complicated  and  highly  uncertain  dynamical 
system.  For  example,  the  model  for  tool-work  temperature  is  purely  an  empirical  one,  and 
only  crude  estimates  are  available  for  the  values  of  most  of  the  parameters  which  appear 
l11  (1)-  Indeed,  although  the  model  (1)  has  shown  reasonable  agreement  with  experimental 
results  in  particular  cases,  there  is  no  general  agreement  in  the  machining  literature  as  to 
an  appropriate  model  for  the  tool  wear  process. 

In  view  of  these  facts,  we  have  decided  to  use  only  the  basic  structural  features  of  the 
tool  wear  model  (1)  in  our  approach,  and  to  assume  that  additional  details  concerning  the 
model  are  not  available  at  present.  More  specifically,  consider  the  following  general  model 
for  the  tool  wear  dynamics: 

x  =  f(x,  u)  (2a) 

F  =  aru  +  bTx  (26) 

where  x  F'fi  is  the  tool  wear  state,  u  £  3t’rr<  is  the  vector  of  (reparame¬ 

terized)  system  inputs  such  as  feed,  speed,  and  depth  of  cut,  F  is  again  the  cutting  force 
which  is  assumed  to  be  measurable,  f  :  0?2  x  quantifies  the  tool  wear  dynamics 

and  is  assumed  to  be  unknown,  and  a  £  b  =  [b  b]T  £  3?2  are  vectors  of  unknown 
constants  which  quantify  the  output  model.  Note  that  the  reparameterization  of  the  inputs 
provides  an  affine  (but  unknown)  relationship  between  the  new  inputs  u  and  the  output 
F .  Observe  that  the  model  (2)  incorporates  only  the  basic  qualitative  features  of  the  tool 
wear  model  (1)  and  does  not  assume  that  any  quantitative  model  information  is  available. 

The  objective  of  this  paper  is  to  develop  an  algorithm  which,  given  knowledge  of 
the  system  inputs  u  and  measurements  of  the  system  output  F,  can  provide  an  accurate 
estimate  of  the  system  state  x.  The  proposed  solution  to  this  problem  is  to  utilize  a 
recurrent  neural  network  to  model  the  dynamical  system  (2)  and  a  robust  observer  to 
estimate  the  tool  wear  state  from  this  model  using  knowledge  of  u  and  F.  The  feasibility 
of  this  approach  depends  on  the  possibility  of  modeling  (2)  with  a  neural  network;  thus  it 
will  be  helpful  to  establish  a  few  facts  concerning  the  approximation  of  dynamical  systems 
using  a  recurrent  neural  network  (RNN).  Consider  the  RNN 

xn  =  -  ¥ Vz(xn ,  u)  (3 ) 
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where  xn  €  3?”  is  the  state  of  the  network,  A  =  diag[a,-]  €  3£nXn  with  the  a,-  positive 
constants,  W  £  dtnxp  is  the  matrix  of  adjustable  weights  for  the  network,  and  z  :  3?”  x 
is  a  function  of  the  network  states  and  inputs.  The  accuracy  with  which  a  RNN  of 
the  form  (3)  can  approximate  a  general  dynamical  system  such  as  (2a)  depends  crucially  on 
the  values  used  for  the  elements  of  W  and  on  the  functional  form  of  z.  Several  methods  for 
selecting  the  form  of  z  have  been  proposed  in  the  neural  network  literature,  and  extensive 
theoretical,  simulation  and  experimental  studies  have  demonstrated  the  effectiveness  of 
these  methods.  In  what  follows,  we  shall  assume  that  z  is  constructed  using  either  recurrent 
higher  order  neural  network  (RHONN)  methods  [e.g.,  11]  or  Gaussian  radial  basis  function 
network  (GRBFN)  techniques  [e.g.,  12].  The  suitability  of  using  either  of  these  choices  for 
z  is  indicated  in  the  following  lemma. 

Lemma  1:  Suppose  that  the  RNN  (3)  represents  either  a  RHONN  or  a  GRBFN  (de¬ 
pending  on  the  choice  of  z)  and  that  the  RNN  (3)  and  the  dynamical  system  (2a)  are 
initially  at  the  same  state  (i.e.,  x(0)  =  x„(0)).  Then  for  any  e  >  0  and  any  T  >  0  there 
exists  an  integer  p  and  a  matrix  W*  such  that  the  network  (3)  with  W  =  W*  satisfies 
||  x(t)  —  xn(t)  ||  <  e  for  all  t  £  [0,T]. 

Proof:  The  proof  is  based  on  the  Stone- Weierstrass  theorem,  and  is  given  for  RHONN 
in  [11]  and  can  be  obtained  for  GRBFN  through  a  straightforward  generalization  of  the 
results  in  [11].  D 

This  result  shows  that,  if  a  sufficient  number  of  interconnections  are  permitted  in  the 
network,  the  dynamical  system  (2a)  can  be  modeled  with  arbitrary7'  accuracy  using  either 
RHONN  or  GRBFN  models.  Observe  that  this  is  an  existence  result,  however,  and  that  it 
does  not  prescribe  a  method  for  obtaining  a  weighting  matrix  W*  which  ensures  that  the 
desired  accuracy  is  obtained.  This  latter  problem  is  considered  in  the  following  section, 
where  the  proposed  tool  wear  estimation  algorithm  is  developed. 

Finally,  we  note  that  our  approach  to  the  design  and  analysis  of  both  the  RNN  and 
observer  components  of  the  tool  wear  estimation  algorithm  is  based  on  Lyapunov  stabil¬ 
ity  theory.  The  following  lemma  establishes  a  useful  result  concerning  a  certain  class  of 
Lyapunov  functions  and  will  be  of  direct  relevance  in  this  development. 

Lemma  2:  Consider  the  coupled  dynamical  system  Xj  =  fi(x1,x2,t),  Xo  =  f2(x3 . x2 . t). 
Let  y(xj,x2,t)  be  a  Lyapunov  function  for  the  system  with  the  properties 

Aa  ||  Xl  ||2  +A2  ||  x2  ||2<  V  <  A3  ||  Xl  ||2  +A4  ||  x2  ||2 
V  <  — A5  ||  xj  ||2  -A6  ||  x2  ||2  +e 

where  e  and  the  A,-  axe  positive  scalar  c&cJf&its.  Define  8  =  max(A3/A5,  A4/A6)  and 
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ri  —  (t>e/ A,-)  7  for  i  =  1,  2.  Then  for  any  initial  state  x1(0),x2(0)  the  system  will  evolve  so 
that  x2(t)  are  uniformly  bounded  and  converge  exponentially  to  the  closed  balls  Br, . 

BT2,  respectively,  where  Br{  =  {xt-  :  ||  Xf-  ||<  r,-}  (see  [13]  for  a  discussion  of  exponential 
convergence  to  a  closed  ball). 

Proof:  The  proof  is  based  on  an  extension  of  the  global  exponential  convergence  theorem 
of  Corless  [13]  and  is  given  in  [14].  a 

3.  Tool  Wear  Estimation  Algorithm 

The  proposed  tool  wear  estimation  algorithm  consists  of  two  components:  a  RNN  and 
a  robust  observer.  In  what  follows,  each  of  these  subsystems  is  described  and  analyzed  in 
detail.  The  performance  of  the  complete  system  is  then  illustrated  through  a  case  study 
in  Section  4. 

3.1  Neural  Network  Model  for  Tool  Wear  Dynamics 

Consider  the  problem  of  approximating  the  dynamical  system  (with  outputs)  given  in 
(2)  using  a  RNN.  First  note  that  modeling  the  output  equation  (2b)  requires  only  that 
the  unknown  constant  vectors  a,  b  be  determined.  Since  (2b)  is  linear  in  these  unknown 
parameters,  there  are  many  techniques  available  for  estimating  them  and  thus  for  approx¬ 
imating  (2b).  For  example,  suppose  that  a  €  &m  and  b  €  ft2  are  estimates  for  a  and 
b,  and  that  these  estimates  are  computed  adaptively  by  comparing  the  evolution  of  the 
output  F(t)  of  the  actual  system  (2)  with  the  “predicted”  output  F(t)  =  au  +  bx.  One 
means  of  calculating  these  estimates  based  on  the  discrepancy  between  F(t)  and  F(t)  is 
to  utilize  the  following  adaptive  identification  scheme: 

S  =  7(F  -  F)  u 

b  =  7 (F  -  F)x  (4) 

where  7  is  a  positive  scalar  constant.  The  suitability  of  this  adaptive  approach  to  estimating 
a,  b  is  indicated  in  the  following  lemma. 

Lemma  3:  Suppose  that  the  input  u(t)  to  the  dynamical  system  (2)  is  chosen  so  that 

/t+T 

y(T)yT(r)dT  >  aIm+2 

for  all  f  >  0,  where  y  =  [uT  xT}T  €  £m+2,  Im+2  is  the  (m  +  2)  x  (m  +  2)  identity  matrix, 
and  T,  a  are  positive  constants.  Then  the  estimation  scheme  (4)  ensures  that  a  — *  a  and 
b  b  as  t  — >  00  for  any  initial  a(0)  and  &(D^9 
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Proof:  Let  c  =  [a7  b’’]’’  and  c  =  [S’’  b1’)’’,  so  that  F  =  cTy  and  F  =  cry.  Consider 
the  Lyapunov  function  candidate  V,  =  dTd/2,  where  *  =  c  -  c.  Differentiating  V,  along 
the  trajectories  of  (4)  yields,  after  simplification 

Vi  =  — T  dT  (F  —  F)  y 
=  ~7<t>TyyT<l> 

Standard  arguments  from  differential  equation  stability  theoiy  then  show  that,  if  the  con¬ 
dition  on  u  given  in  the  statement  of  the  lemma  is  satisfied,  then  <f>  -»  0  as  t  -*  oo  and 
thus  the  desired  estimates  for  a,  b  are  obtained  [15].  D 

Using  the  preceding  result,  it  is  straightforward  to  obtain  accurate  estimates  for  a  and 
b;  for  example,  it  is  relatively  easy  to  ensure  that  the  condition  on  y  stated  in  Lemma  3  is 
satisfied  during  a  “training”  period  for  the  algorithm.  Thus  it  what  follows  it  is  assumed 
that  a  and  b  have  been  identified  in  this  manner,  so  that  the  problem  of  modeling  the 
dynamical  system  (2)  using  a  RNN  reduces  to  the  problem  of  approximating  (2a)  using 
such  a  network.  This  approximation  process  involves  selecting  the  RNN  weights  W  and 
functions  z  in  such  a  way  that  the  response  of  the  RNN  closely  models  the  response  of  (2a)  if 
each  system  is  subjected  to  the  same  inputs.  Lemma  1  indicates  that  an  appropriate  choice 
for  z  can  be  obtained  using  either  RHONN  methods  or  GRBFN  techniques.  Determining 
suitable  values  for  the  elements  of  W  is  ordinarily  accomplished  by  presenting  a  set  of  input 
trajectories  to  both  the  dynamical  system  to  be  modeled  and  the  RNN,  and  adjusting  the 
weighting  matrix  W  in  such  a  way  that  the  discrepancy  between  the  state  trajectories 
for  the  two  systems  is  reduced;  this  process  is  often  termed  “training”  the  RNN.  In  what 
follows,  we  propose  a  robust  training  procedure  for  the  RNN  given  in  (3). 

It  can  be  seen  from  Lemma  1  that,  if  z  is  chosen  so  that  (3)  is  in  RHONN  or  GRBFN 
(with  sufficient  interconnections  p)  then  there  exists  a  choice  for  W,  say  W*.  for  which 
(3)  provides  a  close  approximation  of  (2a).  In  other  words,  given  this  choice  for  W,  the 
dynamical  system  (2a)  can  be  written  as 

x  =  —Ax  +  W* z(x,  u)  +  u)  (5) 

where  ,  :  &  x  S“  -  S2  quantifies  the  (small)  modeling  error.  The  difficulty,  of  course,  is 
determining  W.  A  robust  approach  for  specifying  W,  and  thereby  training  the  RNN,  is 
proposed  in  the  following  theorem. 

Theorem  1:  Consider  the  following  strategy  for  determining  W: 

x„  =  — -4xn  +  W  z(x,  u) 

W  =  -aWWtyezT 
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(6c) 

(66) 


where  e  =  x  —  xn  and  a ,  [3  axe  positive  scalar  constants.  This  strategy  ensures  that  e  and 
W  =  W*  —  W  axe  uniformly  bounded  for  all  model  mismatches  ??,  and  that  the  ultimate 
bound  on  e  is  small  if  rj  is  small. 

Proof:  The  identification  error  dynamics  can  be  obtained  by  differencing  (5)  and  (6a): 

e  =  —  Ae  +  Wz  +  rj  (7) 

Consider  the  Lyapunov  function  candidate 

v*  =  ieTe  +  ^tr  \WWT]  (8) 

Differentiating  (8)  along  (7)  and  simplifying  yields 

V2  =  -erAe  +  eTi)  +  ifr[  W(W  +  0ezT)T] 

<  -»min  II  e  f  +w  ||  e  ||  ||  W  II2.  +|  ||  W  ||p ||  W  ||p 

where  amin  =  min(at),  rjmax  is  an  upper  bound  on  ||  r\  ||,  and  ||  •  \\F  is  the  Probenius  norm. 
Routine  manipulation  allows  the  following  upper  bound  on  V2  to  be  derived: 

Vi<-^l|e|l2-^||^|l2+^  +  ^^ 

Apphcation  of  Lemma  2  then  permits  the  conclusion  that  e,  W  are  uniformly  bounded  and 
axe  guaranteed  to  converge  exponentially  to  a  compact  neighborhood  of  the  origin  in  e,  IT 
space.  A  bound  on  the  ultimate  size  of  ||  e  ||  can  also  be  obtained  using  Lemma  2:  if  13  is 
chosen  reasonably  large  then  the  size  of  this  ultimate  bound  is  proportional  to  the  size  of 
the  model  mismatch  rjmax,  so  that  e  is  small  if  r/  is  small.  □ 

The  results  summarized  in  Lemma  1,  Lemma  3,  and  Theorem  1  define  a  strategy  for 

modeling  the  uncertain  tool  wear  dynamical  system  (2)  using  a  RNN  together  with  an 

(adaptively  obtained)  output  map.  If  the  conditions  stipulated  in  Lemma  1  and  Theorem 

1  are  satisfied,  the  response  of  the  state  x„  of  the  RNN  to  a  given  input  trajectory  should 

provide  an  accurate  approximation  of  the  response  of  the  actual  system  to  this  input 

trajectory.  Thus  this  RNN  could,  in  principle,  be  used  to  give  an  estimate  for  tool  wear 

during  machining.  However,  it  is  highly  desirable  to  introduce  a  feedback  mechanism  into 

this  estimation  algorithm,  so  that  the  wear  estimate  is  influenced  by  the  evolution  of  the 

actual  system.  One  means  of  accomplishing  this  goal  is  described  in  the  next  section 
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3.2  Robust  Observer  for  Tool  Wear 

Let  Wc  €  3£nX?  denote  the  matrix  of  constants  obtained  by  training  the  RNN  (3) 
using  the  robust  identification  scheme  summarized  in  Theorem  1.  If  the  results  given  in 
Lemma  1,  Lemma  3,  and  Theorem  1  are  used  to  construct  the  RNN  model  of  the  tool 
wear  dynamics  (2),  then  this  dynamical  system  can  be  rewritten  as 

x  =  -Ax  +  Wcz(x,  u)  +  ?/(x,  u)  (9a) 

r*  =  brx  (96) 

where  F*  =  F  —  aru.  In  (9),  r)  is  unknown  but  small  and  A,  Wc,  z,  b,  and  F*  are  known 
(observe,  for  example,  that  F*  is  known  because  F  can  be  measured,  u  is  known  and  a 
has  been  identified). 

Our  objective  is  now  to  develop  an  observer  which  can  asymptotically  reconstruct  the 
tool  wear  state  x  from  measurements  of  F*  despite  the  system  uncertainty  quantified  by  rj. 
Note  first  that  the  original  system  (2)  is  observable  [9],  so  that  this  objective  is  attainable. 
Consider  the  observer 


x0  =  — Ax0  +  Wc z(x0,  u)  +  L(F*  -  bTx0)  (10) 

where  x0  €  3£2  is  the  estimate  of  the  wear  state  x  and  L  €  3£2  xl  is  the  observer  gain  matrix 
to  be  specified  later.  Applying  (10)  to  (9)  yields 

e  =  —{A  +  LbT)e  +  Wc[z(x)  -  z(x0)]  +  77  (11) 

where  e  =  x  —  xQ  is  the  state  estimation  error  and  the  dependence  of  z  on  u  has  been 
supressed  for  clarity.  The  performance  characteristics  of  the  observer  (10)  can  be  deter¬ 
mined  by  studying  the  stability  properties  of  (11).  The  following  fact  concerning  the  pair 
of  matrices  (A,  b)  will  be  helpful  in  this  regard. 

Lemma  4:  If  the  RNN  is  designed  so  that  the  diagonal  elements  of  A  are  not  equal  (this 
is  always  possible)  then  the  eigenvalues  of  (A  +  LhF)  can  be  arbitrarily  assigned  through 
the  specification  of  L. 

Proof:  It  is  trivial  to  verify  that  the  matrix  [bT  Arb]  has  full  rank,  and  the  assignability 
of  the  eigenvalues  of  (A  +  LhT)  follows  immediately  from  this  [e.g.,  16].  a 

The  following  theorem  characterizes  the  performance  capabilities  of  the  observer  (10). 

Theorem  2:  The  observer  (10)  ensures  that  the  state  estimation  error  e  is  uniformly 
bounded,  and  has  an  ultimate  bound  which  can  be  reduced  arbitrarily,  provided  that  L  is 

7  1 77 

properly  specified. 


8 


Proof:  Suppose  that  L  is  chosen  so  that  A  =  A min{A  +  LbT)  =  \max(A  +  LbT)/2  is  a 
large  positive  constant.  Let  A  =  diag(A  2A)  and  M-1  denote  the  associated  (normalized) 
modal  matrix,  so  that  A  =  M(A  +  LbT)M~l.  Then  defining  q  =  Me  permits  (11)  to  be 
rewritten 

q  =  -Aq  +  MWc[z(x)  -  z(x0)]  +  Mr}  (12) 

Consider  the  Lyapunov  frmction  candidate  V3  =  qTq/2.  Differentiating  V3  along  the 
trajectories  of  (12)  yields,  after  simplification 

v>  <  -A  II  q  II2  +  II  M  ||F||  ij  II II  q  II  +  ||  M  ||F||  Wc  ||F||  z(x)  -  z(x„)  ||||  q  || 

<  -(A-  K  ||  M  Up'll  W e  ||F)  ||  q  ||2  +  ||  M  Up’ll  n  llll  q  || 

where  I<  >  0  satisfies  I<  ||  M(x-x0)  ||>||  z(x)-z(x0)  ||  Vx,x0  (the  existence  of  such  a  K 
is  a  consequence  of  the  fact  that,  for  RHONN  and  GRBFN  designs,  the  partial  derivatives 
of  z  axe  all  bounded).  Assume  that  L  is  chosen  so  that  Ao  =  A  —  K  ||  M  ||p||  Wc  ||p 
is  positive  (this  is  always  possible).  Then  routine  manipulation  permits  V3  to  be  upper 
bormded  as 

Fs  <  -(Ao  ||  q  ||  -  W  ||  M  ||P)  ||  q  || 

From  Lemma  2  it  can  now  be  concluded  that  ||  q  ||  is  uniformly  bounded  and  that  the 
ultimate  bound  on  this  error  is  j|  q  ||<  r]max  ||  M  || f  /Ao;  note  that  the  ultimate  bound 
on  II  q  II  (aJid  therefore  ||  e  ||)  can  be  decreased  as  desired  by  increasing  A0  through  the 
specification  of  L.  o 

The  observer  (10)  can  be  modified  to  provide  improved  performance  for  those  cases 
in  which  the  RNN  is  designed  so  that  the  model  mismatch  rj  possesses  some  structure. 
For  example,  suppose  that  the  RNN  and  the  observer  gain  matrix  L  are  selected  so  that 
MT Mr\  —  t]2}t  with  rft  r}2  [17],  In  this  situation,  if 

*  _  1  Vi  +V  2 
2  m  +  V2 . 

*.  _  I  [  Vl  -  V2 
2  »?2  -  Vi 

then  MTMrj  =  tj*  +  rj**  and  rj**  is  very  small.  A  modified  observer  can  then  be  developed 
which  exploits  this  additional  information  on  the  stucture  of  the  model  mismatch;  this 
modification  is  proposed  in  the  following  theorem. 

Theorem  3:  Consider  the  observer 

x0  =  —  Ax0  +  TTcz(x0,  '^^(F*  -  bTx0)  +  s(x0,  F*)  (13a) 
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where  all  terms  are  defined  as  in  (10)  and 


^  0  if  bTe  =  0 

with  p  >  0.  Note  that,  although  e  appears  in  the  definition  for  s  in  (13b),  s  can  be 
computed  using  only  knowledge  of  x0  and  F*  so  that  (13)  is  implementable.  The  observer 
(13)  ensures  that  the  state  estimation  error  e  is  uniformly  bounded  provided  that  L  and  p 
are  properly  specified.  Moreover,  if  r)**  is  negligible  then  e  converges  to  zero  exponentially. 
Proof:  The  observer  (13)  leads  to  the  following  state  estimation  error  dynamics  (in  the 
error  coordinate  q) 

q  =  -Aq  +•  MWC[ z(x)  -  z(x0)]  +  M[(MTM)~1rj*  -  s(x0,  F*)]  (14) 

where  it  is  assumed  that  p**  is  negligible.  Differentiating  the  Lyapunov  function  candidate 
V3  defined  in  Theorem  2  along  the  trajectories  of  (14)  and  simplifying  yields  the  following 
upper  bound  on  V3: 

v,  <  -(A  -  K  II  M  IIHI  wc  ||F)  II  q  ||2  +eV  -  Ib^e^ 

<  -(A  -  K  ||  M  IIHI  Wc  ||F)  ||  q  ||2  +erb,“*  -  lb1’*!/. 

where  77***  =  (r?i  +  172) /2b  and  all  the  other  terms  are  as  defined  in  the  proof  of  Theorem 
2.  Thus  if  p  is  chosen  so  that  p  >  77***  V  x,  u  then 

y3  <  —(A  —  I<  ||  M  ||f  ||  Wc  ||F)  ||  q  ||2  -|bTe|(p  -  77***) 

<  —(A  —  I<  \\M  ||f  ||  Wc  ||f)  ||  q  ||2 

and  the  exponential  convergence  to  the  origin  of  q  (and  therefore  e)  follows  from  Lemma 
2  (with  e  =  0).  □ 

4.  Case  Study:  Turning 

The  proposed  tool  wear  estimation  algorithm  was  evaluated  through  a  case  study 
involving  a  turning  application.  In  this  case  study,  AISI  medium  carbon  steel  cylinders 
having  223  BHN  hardness  were  turned  using  a  CNC  lathe  with  Valenite  VC55  TNMA432 
uncoated  cemented  carbide  tools.  Cutting  force  measurements  were  obtained  for  all  trials, 
and  the  following  cutting  conditions  were  used  throughout  the  case  study:  cutting  speed 
=  200m/min,  feed  =  0.15mm/rev,  and  depth  of  cut  =  1.27mm. 

The  tool  wear  estimation  algorithm  utilized  in  this  case  study  consisted  of  an  RHONN 
of  the  form  given  in  (3)  together  with  the  siaf^bserver  (10).  It  is  noted  that  the  observer 
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(13)  was  also  implemented  for  several  trials  and  provided  results  which  were  indistinguish¬ 
able  from  those  obtained  using  (10),  so  that  only  the  results  using  (10)  are  reported  here. 
The  RHONN  structure  was  determined  using  the  design  procedure  given  in  [11].  The  net¬ 
work  was  trained  using  data  generated  through  computer  simulation  using  the  tool  wear 
model  proposed  in  [10].  In  each  trial  only  two  cutting  experiments  were  simulated,  so 
that  the  training  data  sets  were  quite  modest.  The  parameter  values  used  in  the  observer 
(10)  were  obtained  after  a  few  iterations  using  computer  simulation  data,  and  no  attempt 
was  made  to  “time”  the  parameters  to  achieve  optimal  performance.  Instead,  two  sets 
of  observer  parameters  were  used:  a  “low  gain”  observer  which  resulted  in  a  moderate 
magnitude  for  A,  and  a  “high  gain”  observer  which  placed  the  magnitude  of  A  at  a  large 
value.  All  details  concerning  the  implementation  of  the  proposed  estimation  algorithm, 
including  the  structure  of  the  RHONN  and  the  parameter  values  for  the  network  and  the 
observer,  are  given  in  [18]. 

To  evaluate  the  performance  of  the  “trained”  estimation  algorithm,  the  algorithm  was 
tested  using  the  experimental  data  obtained  by  Park  and  Ulsoy  in  [9].  More  specifically, 
the  force  measurement  trajectories  reported  in  [9]  were  used  as  inputs  to  the  wear  state 
estimation  algorithm  to  generate  estimated  tool  wear  trajectories.  The  estimates  for  the 
flank  wear  components  due  to  abrasion  (u j1 )  and  diffusion  (wf2)  for  typical  trials  with 
the  low  gain  observer  and  high  gain  observer  are  shown  in  Figures  la  and  lb,  respectively. 
Note  that,  although  the  estimates  of  the  individual  components  of  tool  wear  obtained  using 
the  high  gain  observer  are  the  more  reasonable  (the  low  gain  observer  estimates  a  reduction 
in  wear  due  to  diffusion  in  this  trial),  the  estimate  of  total  tool  wear  wj  =  wjx  +  w/2  is 
comparable  in  each  case.  The  evolution  of  the  total  tool  wear  Wf  obtained  using  the  high 
gain  observer  is  shown  in  Figure  2,  and  it  can  be  seen  that  this  estimate  agrees  quite  closely 
with  the  experimental  measurements  made  by  Park  and  Ulsoy  [9]. 

5.  Conclusions 

This  paper  presents  a  robust  strategy  for  estimating  tool  wear  in  metal  cutting  opera¬ 
tions.  The  proposed  estimation  algorithm  consists  of  a  recurrent  neural  network  to  model 
the  tool  wear  dynamics,  and  a  state  observer  to  estimate  the  tool  wear  from  this  model 
using  measurements  of  cutting  force.  A  stability  analysis  is  performed  to  demonstrate 
that  the  algorithm  provides  uniformly  bounded  tool  wear  estimation  error  and  that  the 
ultimate  bound  on  this  error  can  be  reduced  arbitrarily.  The  proposed  approach  is  applied 
to  the  problem  of  estimating  tool  wear  in  turning  and  is  shown  to  provide  wear  estimates 
which  are  in  close  agreement  with  experimental  results.  Future  work  will  focus  on  applying 
the  proposed  approach  of  real-time  state  $slET&ation  to  other  problems  of  importance  in 
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manufacturing. 
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Figure  la:  Estimated  flank  wear  components  (low  gain  observer) 


Figure  lb:  Estimated  flank  wear  coiqpcpgnts  (high  gain  observer) 
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Abstract 

This  paper  presents  a  robust  strategy  for  esti¬ 
mating  tool  wear  in  metal  cutting  operations.  The 
proposed  estimation  algorithm  consists  of  two  compo¬ 
nents:  a  recurrent  neural  network  to  model  the  tool 
wear  dynamics,  and  a  robust  observer  to  estimate  the 
tool  wear  from  this  model  using  measurements  of  cut¬ 
ting  force.  It  is  shown  that  the  algorithm  ensures  that 
the  tool  wear  estimation  error  is  uniformly  bounded 
in  the  presence  of  bounded  unmodeled  effects,  and 
that  the  ultimate  bound  on  this  error  can  be  made 
as  small  as  desired.  The  proposed  approach  is  applied 
to  the  problem  of  estimating  tool  wear  in  turning  and 
is  shown  to  provide  wear  estimates  which  are  in  close 
agreement  with  published  experimental  results. 

1.  Introduction 

There  is  considerable  interest  in  increasing  the 
level  of  automation  in  manufacturing  processes.  In¬ 
cluded  among  the  anticipated  benefits  of  this  au¬ 
tomation  are  enhanced  productivity,  improved  prod¬ 
uct  quality,  and  reduced  manufacturing  costs.  How¬ 
ever,  progress  toward  the  development  of  automated 
manufacturing  systems  has  been  hindered  by  the  lack 
of  dependable  machine  tool  diagnostic  schemes.  In 
particular,  the  problem  of  reliably  measuring  the  tool 
wear  has  been  a  significant  obstacle  to  the  realization 
of  effective  automated  machining  systems.  The  ap¬ 
proaches  proposed  thusfar  for  measuring  tool  wear  can 
be  divided  into  two  broad  classes:  direct  methods  and 
indirect  methods.  In  the  direct  approach,  tool  wear  is 
measured  directly  using,  for  example,  microscopes  or 
computer  vision  technology.  While  this  method  can 
provide  very  accurate  wear  information,  it  is  inherently 
an  off-line  approach  and  is  therefore  limited  in  its  ap¬ 
plicability.  Alternatively,  the  indirect  approach  to  tool 
wear  measurement  is  to  estimate  the  wear  state  from 
measurable  cutting  variables,  such  as  cutting  force  or 
acoustic  emission.  Observe  that  the  indirect  method 
can  be  used  for  real-time  tool  wear  determination  and 
consequently  has  considerable  potential  for  use  in  au¬ 
tomated  machining  systems.  However,  the  accuracy  of 
existing  indirect  techniques  is  often  unsatisfactory. 

Recognizing  the  importance  of  reliably  estimat¬ 
ing  the  machine  tool  wear  state  in  real-time,  several^ 


researchers  have  studied  this  problem  in  recent  years 
[e.g.,  1-10].  Early  work  focused  on  the  application  of 
statistical  methods  to  associate  patterns  in  measurable 
signals  with  wear  states  [e.g.,  1,2].  More  recently,  it 
has  been  proposed  that  this  pattern  recognition  could 
be  accomplished  effectively  using  neural  networks  [3- 
5].  A  different  approach  to  tool  wear  estimation  is 
to  acknowledge  that  tool  wear  evolves  as  a  dynami¬ 
cal  system  during  the  cutting  process,  and  to  apply 
system  theoretic  ideas  to  the  estimation  of  wear  state 
[e.g.,  6-10].  The  principle  difficulty  associated  with 
this  strategy  is  the  nonlinear  nature  of  the  process 
model  and  the  lack  of  good  estimates  for  the  model 
parameter  values,  so  that  traditional  state  observation 
techniques  are  difficult  to  apply.  Nevertheless,  this 
method  has  considerable  potential  because  the  formu¬ 
lation  is  compatible  with  the  dynamic  character  of  the 
tool  wear  process. 

This  paper  presents  a  new  strategy  for  the  real¬ 
time  estimation  of  tool  wear  in  metal  cutting  opera¬ 
tions.  The  proposed  estimation  algorithm  consists  of 
two  components:  a  recurrent  neural  network  to  model 
the  tool  wear  dynamics,  and  a  robust  observer  to  es¬ 
timate  the  tool  wear  from  this  model  using  measure¬ 
ments  of  cutting  force.  Thus  the  approach  is  to  train 
the  neural  network  so  that  it  provides  a  good  model  for 
the  nonlinear  and  uncertain  tool  wear  dynamical  sys¬ 
tem,  and  then  to  design  the  observer  to  ensure  good 
performance  of  the  overall  system  in  the  presence  of 
(inevitable)  modeling  error.  A  stability  analysis  is  con¬ 
ducted  to  show  that  tool  wear  estimation  error  is  uni¬ 
formly  bounded  with  an  ultimate  bound  that  can  be 
reduced  arbitrarily.  The  efficacy  of  the  proposed  ap¬ 
proach  is  demonstrated  through  a  case  study  involving 
the  estimation  of  tool  wear  in  a  turning  application;  it 
is  shown  in  this  study  that  the  algorithm  provides  wear 
estimates  which  are  in  close  agreement  with  published 
experimental  results. 

2.  Preliminaries 

This  paper  considers  the  problem  of  estimating  in 
real-time  the  tool  wear  state  during  machining  opera¬ 
tions.  Numerous  researchers  have  studied  the  cutting 
process  and  proposed  models  for  tool  wear.  For  exam- 
l®ple,  Park  and  Ulsoy  [9]  have  proposed  that  flank  wear 


dominates  the  tool  wear  process  in  many  important 
applications,  and  have  proposed  the  following  model 
for  this  component  of  tool  wear: 


wj1  =  -kiVWfi  4  k2 


F  v 

Id 


wj2  =  k3y/vex p( 


) 


F  =  k$fnd  4  k§vd  4  kyd  4  k$d(wf1  4- tt>y3) 


(i) 


where  wjx  and  Wf2  are  the  flank  wear  components  due 
to  abrasion  and  diffusion,  respectively,  wj  =  +wj2 
is  the  toted  flank  wear,  /,v,d  are  the  feed,  cutting 
speed,  and  depth  of  cut,  respectively,  and  are  the  in¬ 
puts  for  the  system,  and  F  is  the  cutting  force  and 
is  the  output  (measurable  variable)  for  the  system. 
In  (1),  n  and  the  k{  are  constants  which  depend  on 
the  effective  rake  angle,  workpiece  material,  and  other 
task-related  parameters,  and  T  is  the  tool- work  tem¬ 
perature  and  depends  on  the  inputs  /,  v  and  the  state 
if/i, vjf2.  Observe  that  the  tool  wear  model  (1)  is  a 
complicated  and  highly  uncertain  dynamical  system. 
For  example,  the  model  for  tool- work  temperature 
is  purely  an  empirical  one,  and  only  crude  estimates 
axe  available  for  the  values  of  most  of  the  parameters 
which  appear  in  (1).  Indeed,  although  the  model  (1) 
has  shown  reasonable  agreement  with  experimental  re¬ 
sults  in  particular  cases,  there  is  no  general  agreement 
in  the  machining  literature  as  to  an  appropriate  model 
for  the  tool  wear  process. 

In  view  of  these  facts,  we  have  decided  to  use  only 
the  basic  structural  features  of  the  tool  wear  model 
(1)  in  our  approach,  and  to  assume  that  additional  de¬ 
tails  concerning  the  model  are  not  available  at  present. 
More  specifically,  consider  the  following  general  model 
for  the  tool  wear  dynamics: 

x  =  f(x,u)  (2a) 

F  =  aTu  +  bTx  (26) 


where  x  =  [ w jx  wj2]t  E  3ft2  is  the  tool  wear  state, 
u  E  3ftm  is  the  vector  of  (reparameterized)  system  in¬ 
puts  such  as  feed,  speed,  and  depth  of  cut,  F  is  again 
the  cutting  force  which  is  assumed  to  be  measurable, 
f  :  3ft2  x  3ftm  — ►  3ft2  quantifies  the  tool  wear  dynam¬ 
ics  and  is  assumed  to  be  unknown,  and  a  E  9ftm, 
b  =  [6  b]T  E  9ft2  are  vectors  of  unknown  constants 
which  quantify  the  output  model.  Note  that  the  repa¬ 
rameterization  of  the  inputs  provides  an  affine  (but 
unknown)  relationship  between  the  new  inputs  u  and 
the  output  F .  Observe  that  the  model  (2)  incorpo¬ 
rates  only  the  basic  qualitative  features  of  the  tool 
wear  model  (1)  and  does  not  assume  that  any  quanti¬ 
tative  model  information  is  available. 


The  objective  of  this  paper  is  to  develop  an  algo¬ 
rithm  which,  given  knowledge  of  the  system  inputs  u 
and  measurements  of  the  system  output  F,  can  pro¬ 
vide  an  accurate  estimate  of  the  system  state  x.  The 
proposed  solution  to  this  problem  is  to  utilize  a  recur¬ 
rent  neural  network  to  model  the  dynamical  system 
(2)  and  a  robust  observer  to  estimate  the  tool  wear 
state  from  this  model  using  knowledge  of  u  and  F. 
The  feasibility  of  this  approach  depends  on  the  possi¬ 
bility  of  modeling  (2)  with  a  neural  network;  thus  it 
will  be  helpful  to  establish  a  few  facts  concerning  the 
approximation  of  dynamical  systems  using  a  recurrent 
neural  network  (RNN).  Consider  the  RNN 


xn  =  ~Axn  4  Wz(x„ ,  u)  (3) 

where  xn  E  3ft11  is  the  state  of  the  network,  A  = 
diagfa,]  E  3ftnxn  with  the  a*  positive  constants,  W  E 
3ftnxp  is  the  matrix  of  adjustable  weights  for  the  net¬ 
work,  and  z  :  3ftn  x  3ftm  — *  3ftp  is  a  function  of  the 
network  states  and  inputs.  The  accuracy  with  which 
a  RNN  of  the  form  (3)  can  approximate  a  general  dy¬ 
namical  system  such  as  (2a)  depends  crucially  on  the 
values  used  for  the  elements  of  W  and  on  the  functional 
form  of  z.  Several  methods  for  selecting  the  form  of  z 
have  been  proposed  in  the  neural  network  literature, 
and  extensive  theoretical,  simulation,  and  experimen¬ 
tal  studies  have  demonstrated  the  effectiveness  of  these 
methods.  In  what  follows,  we  shall  assume  that  z  is 
constructed  using  either  recurrent  higher  order  neural 
network  (RHONN)  methods  [e.g.,  11]  or  Gaussian  rar 
dial  basis  function  network  (GRBFN)  techniques  [e.g., 
12].  The  suitability  of  using  either  of  these  choices  for 
z  is  indicated  in  the  following  lemma. 

Lemma  1:  Suppose  that  the  RNN  (3)  represents  ei¬ 
ther  a  RHONN  or  a  GRBFN  (depending  on  the  choice 
of  z)  and  that  the  RNN  (3)  and  the  dynamical  system 
(2a)  are  initially  at  the  same  state  (i.e.,  x(0)  =  xn(0)). 
Then  for  any  e  >  0  and  any  T  >  0  there  exists  an  inte¬ 
ger  p  and  a  matrix  W*  such  that  the  network  (3)  with 
W  =  W*  satisfies  ||  x(f)  —  xn(t)  ||<  e  for  all  t  E  [0,T]. 
Proof:  The  proof  is  based  on  the  Stone- Weierstrass 
theorem,  and  is  given  for  RHONN  in  [11]  and  can  be 
obtained  for  GRBFN  through  a  straightforward  gen¬ 
eralization  of  the  results  in  [11].  n 

This  result  shows  that,  if  a  sufficient  number  of 
interconnections  are  permitted  in  the  network,  the  dy¬ 
namical  system  (2a)  can  be  modeled  with  arbitrary  ac¬ 
curacy  using  either  RHONN  or  GRBFN  models.  Ob¬ 
serve  that  this  is  an  existence  result,  however,  and  that 
it  does  not  prescribe  a  method  for  obtaining  a  weight¬ 
ing  matrix  W*  which  ensures  that  the  desired  accuracy 
^  ”  ^^s  obtained.  This  latter  problem  is  considered  in  the 


following  section,  where  the  proposed  tool  wear  esti¬ 
mation  algorithm  is  developed. 

Finally,  we  note  that  our  approach  to  the  design 
and  analysis  of  both  the  RNN  and  observer  compo¬ 
nents  of  the  tool  wear  estimation  algorithm  is  based 
on  Lyapunov  stability  theory.  The  following  lemma 
establishes  a  useful  result  concerning  a  certain  class  of 
Lyapunov  functions  and  will  be  of  direct  relevance  in 
this  development. 

Lemma  2:  Consider  the  coupled  dynamical  system 
Xi  =  fi(xi,x2,*),  x2  =  f2(xi,x2,t).  Let  7(xi,x2,<) 
be  a  Lyapunov  function  for  the  system  with  the  prop¬ 
erties 

II  Xi  II2  +A2  ||  x2  ||2<  V  <  A3  II  X:  II2  +A4  ||  x2  II2 
v  <  — ||  Xi  II2  -A6  II  x2  II2  +e 

where  e  and  the  A*  are  positive  scalar  constants.  De¬ 
fine  S  =  max(A3/A5,  A4/A6)  and  n  =  (Se/X ,)1/2  for 
i  =  1,2.  Then  for  any  initial  state  xi(0),x2(0)  the 
system  will  evolve  so  that  xi(f),x2(f)  are  uniformly 
bounded  and  converge  exponentially  to  the  closed  balls 
BriJ  Br2 ,  respectively,  where  Bri  =  {x,-  :  ||  xt  ||<  rt*} 
(see  [13]  for  a  discussion  of  exponential  convergence  to 
a  closed  ball). 

Proof:  The  proof  is  based  on  an  extension  of  the 
global  exponential  convergence  theorem  of  Corless  [13] 
and  is  given  in  [14].  □ 

3.  Tool  Wear  Estimation  Algorithm 

The  proposed  tool  wear  estimation  algorithm  con¬ 
sists  of  two  components:  a  RNN  and  a  robust  observer. 
In  what  follows,  each  of  these  subsystems  is  described 
and  analyzed  in  detail.  The  performance  of  the  com¬ 
plete  system  is  then  illustrated  through  a  case  study 
in  Section  4. 

3.1  Neural  Network  Model 

Consider  the  problem  of  approximating  the  dy¬ 
namical  system  (with  outputs)  given  in  (2)  using  a 
RNN.  First  note  that  modeling  the  output  equation 
(2b)  requires  only  that  the  unknown  constant  vectors 
a,b  be  determined.  Since  (2b)  is  linear  in  these  un¬ 
known  parameters,  there  are  many  techniques  avail¬ 
able  for  estimating  them  and  thus  for  approximating 
(2b).  For  example,  suppose  that  a  £  and  b  £  5R2 
are  estimates  for  a  and  b,  and  that  these  estimates 
are  computed  adaptively  by  comparing  the  evolution 
of  the  output  F(f)  of  the  actual  system  (2)  with  the 
“predicted”  output  F(t)  =  au  +  bx.  One  means  of 
calculating  these  estimates  based  on  the  discrepancy 
between  F(f)  and  -F(f)  is  to  utilize  the  following  adap-^ 


tive  identification  scheme: 

a  =  y(F  -  F) u 

b  =  y(F  —  F)x  (4) 

where  7  is  a  positive  scalar  constant.  The  suitability  of 
this  adaptive  approach  to  estimating  a,  b  is  indicated 
in  the  following  lemma. 

Lemma  3:  Suppose  that  the  input  u(<)  to  the  dy¬ 
namical  system  (2)  is  chosen  so  that 

/t+T 

y(r)yT(r)dr  >  aIm+2 

for  all  t  >  0,  where  y  =  [uT  xT]T  £  3?m+2,  /m+2  is 
the  (m+2)  x  (m+  2)  identity  matrix,  and  T} a  are 
positive  constants.  Then  the  estimation  scheme  (4) 

ensures  that  a  — ►  a  and  b  — ►  b  as  t  — ►  00  for  any 
initial  a(0)  and  b(0). 

Proof:  Let  c  =  [aT  bT]T  and  c  =  [aT  bT]T,  so 
that  F  =  cTy  and  F  =  cTy.  Consider  the  Lyapunov 
function  candidate  V\  =  <f>T<j>/ 2,  where  <j)  =  c  —  c.  Dif¬ 
ferentiating  Vi  along  the  trajectories  of  (4)  yields,  after 
simplification,  V\  —  —y<f>TyyT<f>.  Standard  arguments 
from  differential  equation  stability  theory  then  show 
that,  if  the  condition  on  u  given  in  the  statement  of 
the  lemma  is  satisfied,  then  <j>  0  as  t  — ►  00  and  thus 

the  desired  estimates  for  a,b  are  obtained  [15].  o 

Using  the  preceding  result,  it  is  straightforward  to 
obtain  accurate  estimates  for  a  and  b;  for  example,  it  is 
relatively  easy  to  ensure  that  the  condition  on  y  stated 
in  Lemma  3  is  satisfied  during  a  “training”  period  for 
the  algorithm.  Thus  it  what  follows  it  is  assumed  that 
a  and  b  have  been  identified  in  this  manner,  so  that  the 
problem  of  modeling  the  dynamical  system  (2)  using 
a  RNN  reduces  to  the  problem  of  approximating  (2a) 
using  such  a  network.  This  approximation  process  in¬ 
volves  selecting  the  RNN  weights  W  and  functions  z  in 
such  a  way  that  the  response  of  the  RNN  closely  mod¬ 
els  the  response  of  (2a)  if  each  system  is  subjected  to 
the  same  inputs.  Lemma  1  indicates  that  an  appropri¬ 
ate  choice  for  z  can  be  obtained  using  either  RHONN 
methods  or  GRBFN  techniques.  Determining  suitable 
values  for  the  elements  of  W  is  ordinarily  accomplished 
by  presenting  a  set  of  input  trajectories  to  both  the 
dynamical  system  to  be  modeled  and  the  RNN,  and 
adjusting  the  weighting  matrix  W  in  such  a  way  that 
the  discrepancy  between  the  state  trajectories  for  the 
two  systems  is  reduced;  this  process  is  often  termed 
“training”  the  RNN.  In  what  follows,  we  propose  a 
robust  training  procedure  for  the  RNN  given  in  (3). 
^2  If  can  be  seen  from  Lemma  1  that,  if  z  is  chosen 
so  that  (3)  is  in  RHONN  or  GRBFN  (with  sufficient 


interconnections  p)  then  there  exists  a  choice  for  W , 
say  W* ,  for  which  (3)  provides  a  close  approximation 
of  (2a).  In  other  words,  given  this  choice  for  Wy  the 
dynamical  system  (2a)  can  be  written  as 

x  =  —Ax  +  W*z(x ,  u)  ■+•  7/(x,  u)  (5) 

where  rj  :  3J2  x  3 — ►  3i2  quantifies  the  (small)  mod¬ 
eling  error.  The  difficulty,  of  course,  is  determining 
W *.  A  robust  approach  for  specifying  W ,  and  thereby 
training  the  RNN,  is  proposed  in  the  following  theo¬ 
rem. 

Theorem  1:  Consider  the  following  strategy  for  de¬ 
termining  W: 

x„  =  —Axn  +  Wz(x,  u)  (6a) 

W=  -aW  +  /3ezT  (66) 

where  e  =  x— xn  and  o,  j3  are  positive  scalar  constants. 
This  strategy  ensures  that  e  and  W  =  W*  —  W  are 
uniformly  bounded  for  all  model  mismatches  77,  and 
that  the  ultimate  bound  on  e  is  small  if  77  is  small. 
Proof:  The  identification  error  dynamics  can  be  ob¬ 
tained  by  differencing  (5)  and  (6a): 

e  =  —Ae  -f  Wz  -f  rj  (7) 

Consider  the  Lyapunov  function  candidate 

V2  =  ±eTe  +  ±tr[WWT]  (8) 


to  a  given  input  trajectory  should  provide  an  accurate 
approximation  of  the  response  of  the  actual  system  to 
this  input  trajectory.  Thus  this  RNN  could,  in  princi¬ 
ple,  be  used  to  give  an  estimate  for  tool  wear  during 
machining.  However,  it  is  highly  desirable  to  introduce 
a  feedback  mechanism  into  this  estimation  algorithm, 
so  that  the  wear  estimate  is  influenced  by  the  evolu¬ 
tion  of  the  actual  system.  One  means  of  accomplishing 
this  goal  is  described  in  the  next  section. 

3.2  Robust  Observer 

Let  Wc  E  ^xp  denote  the  matrix  of  constants 
obtained  by  training  the  RNN  (3)  using  the  robust 
identification  scheme  summarized  in  Theorem  1.  If 
the  results  given  in  Lemma  1,  Lemma  3,  and  Theorem 
1  are  used  to  construct  the  RNN  model  of  the  tool 
wear  dynamics  (2),  then  this  dynamical  system  can  be 
rewritten  as 

x  =  -  Ax  -f  Wcz(x,  u)  -f  t/(x,  u)  (9a) 
F*  =  bTx  (96) 

where  F*  =  F  —  aTu.  In  (9),  77  is  unknown  but  small 
and  A,  WC)  z,  b,  and  F*  are  known  (observe,  for  exam¬ 
ple,  that  F*  is  known  because  F  can  be  measured,  u 
is  known  and  a  has  been  identified). 

Our  objective  is  now  to  develop  an  observer  which 
can  asymptotically  reconstruct  the  tool  wear  state  x 
from  measurements  of  F*  despite  the  system  uncer¬ 
tainty  quantified  by  77.  Note  first  that  the  original 
system  (2)  is  observable  [9],  so  that  this  objective  is 
attainable.  Consider  the  observer 


Differentiating  (8)  along  (7)  and  simplifying  yields 


x0  =  —  Ax0  -f  Wc z(x0,  u)  +  L(F*  -  bTx0)  (10) 


w  Hf.  +- 


w  jjj. 

2/? 


where  am  =  min(at),  ijmax  is  an  upper  bound  on  || 
rj  ||,  and  ||  •  ||f  is  the  Frobenius  norm.  Application 
of  Lemma  2  then  permits  the  conclusion  that  e,  W 
are  uniformly  bounded  and  are  guaranteed  to  converge 
exponentially  to  a  compact  neighborhood  of  the  origin 
in  e,  W  space.  A  bound  on  the  ultimate  size  of  ||  e  || 
can  also  be  obtained  using  Lemma  2;  if  f3  is  chosen 
reasonably  large  then  the  size  of  this  ultimate  bound 
is  proportional  to  the  size  of  the  model  mismatch  r)max , 
so  that  e  is  small  if  77  is  small.  □ 

The  results  summarized  in  Lemma  1,  Lemma  3, 
and  Theorem  1  define  a  strategy  for  modeling  the  un¬ 
certain  tool  wear  dynamical  system  (2)  using  a  RNN 
together  with  an  (adaptively  obtained)  output  map.  If 
the  conditions  stipulated  in  Lemma  1  and  Theorem  1^ 
are  satisfied,  the  response  of  the  state  xn  of  the  RNN 


where  xQ  E  3 ft2  is  the  estimate  of  the  wear  state  x  and 
L  E  9i2xl  is  the  observer  gain  matrix  to  be  specified 
later.  Applying  (10)  to  (9)  yields 

e  =  -(A  +  LbT)e  +  We[ z(x)  -  z(x0)]  + 17  (11) 

where  e  =  x  —  x0  is  the  state  estimation  error  and  the 
dependence  of  z  on  u  has  been  supressed  for  clarity. 
The  performance  characteristics  of  the  observer  (10) 
can  be  determined  by  studying  the  stability  proper¬ 
ties  of  (11).  The  following  fact  concerning  the  pair  of 
matrices  (A,  b)  will  be  helpful  in  this  regard. 

Lemma  4:  If  the  RNN  is  designed  so  that  the  diagonal 
elements  of  A  are  not  equal  (this  is  always  possible) 
then  the  eigenvalues  of  (A  +  LbT)  can  be  arbitrarily 
assigned  through  the  specification  of  L. 

Proof:  It  is  trivial  to  verify  that  the  matrix  [bT  ATb] 
^has  full  rank,  and  the  assignability  of  the  eigenvalues 
183of  (A  +  LbT)  follows  directly  from  this  [16].  o 


The  following  theorem  characterizes  the  perfor¬ 
mance  capabilities  of  the  observer  (10).. 

Theorem  2:  The  observer  (10)  ensures  that  the  state 
estimation  error  e  is  uniformly  bounded,  and  has  an 
ultimate  bound  which  can  be  reduced  arbitrarily,  pro¬ 
vided  that  L  is  properly  specified. 

Proof:  Suppose  that  L  is  chosen  so  that  A  =  Amin(A+ 
LbT )  =  A max{A  +  LbT)/2  is  a  large  positive  con¬ 
stant.  Let  A  =  diag(A  2A)  and  M~x  denote  the 
associated  (normalized)  modal  matrix,  so  that  A  = 
M(A  +  LbT)Af~1.  Then  defining  q  =  Me  permits 
(11)  to  be  rewritten 

q  =  — Aq  +  MWc[z(x)  -  z(x0)]  +  Mi)  (12) 

Consider  the  Lyapunov  function  candidate  V3  = 
qTq/2.  Differentiating  Vs  along  the  trajectories  of  (12) 
yields,  after  simplification 

Vs  <  —(A  —  K  \\M  ||,p||  We  ||f)  11  q  ||2 
+  l|M|Hblll|q|| 

where  K  >  0  satisfies  K  ||  M(x  —  x0)  ||>||  z(x)  — 
z(x0)  ||  Vx,x0  (the  existence  of  such  a  AT  is  a  con¬ 
sequence  of  the  fact  that,  for  RHONN  and  GRBFN 
designs,  the  partial  derivatives  of  z  are  all  bounded). 
Assume  that  L  is  chosen  so  that  \o  =  \  —  K  \\  M  \\f 
||  Wc  | |f  is  positive  (this  is  always  possible).  Then 
routine  manipulation  permits  Vs  to  be  upper  bounded 
as 

V3<-(X0  ||  q  ||  -Tlmax  ||  M  ||F)  ||  q  || 

From  Lemma  2  it  can  now  be  concluded  that  ||  q  || 
is  uniformly  bounded  and  that  the  ultimate  bound  on 
this  error  is  ||  q  ||<  t)max  ||  Af  \\f  /A0;  note  that  the 
ultimate  bound  on  ||  q  ||  (and  therefore  ||  e  ||)  can 
be  decreased  as  desired  by  increasing  Ao  through  the 
specification  of  L.  □ 

4.  Case  Study:  Turning 

The  proposed  tool  wear  estimation  algorithm  was 
evaluated  through  a  case  study  involving  a  turning 
application.  In  this  case  study,  AISI  medium  carbon 
steel  cylinders  having  223  BHN  hardness  were  turned 
using  a  CNC  lathe  with  Valenite  VC55  TNMA432  un¬ 
coated  cemented  carbide  tools  [9].  Cutting  force  mea¬ 
surements  were  obtained  for  all  trials,  and  the  fol¬ 
lowing  cutting  conditions  were  used  throughout  the 
case  study  [9]:  cutting  speed  =  200m/min,  feed  = 
0.15mm/rev,  and  depth  of  cut  =  1.27mm. 

The  tool  wear  estimation  algorithm  utilized  in  this 
case  study  consisted  of  an  RHONN  of  the  form  given  in 
(3)  together  with  the  state  observer  (10).  The  RHONN  2 


structure  was  determined  using  the  design  procedure 
given  in  [11].  The  network  was  trained  using  data 
generated  through  computer  simulation  using  the  tool 
wear  model  proposed  in  [10],  In  each  trial  only  two 
cutting  experiments  were  simulated,  so  that  the  train¬ 
ing  data  sets  were  quite  modest.  The  parameter  val¬ 
ues  used  in  the  observer  (10)  were  obtained  after  a  few 
iterations  using  computer  simulation  data,  and  no  at¬ 
tempt  was  made  to  “tune”  the  parameters  to  achieve 
optimal  performance.  Instead,  two  sets  of  observer 
parameters  were  used:  a  “low  gain”  observer  which 
resulted  in  a  moderate  magnitude  for  A,  and  a  “high 
gain”  observer  which  placed  the  magnitude  of  A  at  a 
large  value.  All  details  concerning  the  implementation 
of  the  proposed  estimation  algorithm,  including  the 
structure  of  the  RHONN  and  the  parameter  values  for 
the  network  and  the  observer,  are  given  in  [17], 

To  evaluate  the  performance  of  the  “trained”  esti¬ 
mation  algorithm,  the  algorithm  was  tested  using  the 
experimental  data  obtained  by  Park  and  Ulsoy  in  [9]. 
More  specifically,  the  force  measurement  trajectories 
reported  in  [9]  were  used  as  inputs  to  the  wear  state 
estimation  algorithm  to  generate  estimated  tool  wear 
trajectories.  The  estimates  for  the  flank  wear  com¬ 
ponents  due  to  abrasion  (t u/J  and  diffusion  (ti?/a)  for 
a  typical  trial  with  the  high  gain  observer  are  shown 
in  Figure  1;  estimates  obtained  using  the  low  gain 
observer  were  similar  and  hence  are  not  shown.  The 
evolution  of  the  total  tool  wear  wj  obtained  using  the 
high  gain  observer  is  shown  in  Figure  2,  and  it  can  be 
seen  that  this  estimate  agrees  quite  closely  with  the 
experimental  measurements  made  by  Park  and  Ulsoy 
[9], 

5.  Conclusions 

This  paper  presents  a  robust  strategy  for  esti¬ 
mating  tool  wear  in  metal  cutting  operations.  The 
proposed  estimation  algorithm  consists  of  a  recurrent 
neural  network  to  model  the  tool  wear  dynamics,  and 
a  state  observer  to  estimate  the  tool  wear  from  this 
model  using  measurements  of  cutting  force.  A  sta¬ 
bility  analysis  is  performed  to  demonstrate  that  the 
algorithm  provides  uniformly  bounded  tool  wear  esti¬ 
mation  error  and  that  the  ultimate  bound  on  this  error 
can  be  reduced  arbitrarily.  The  proposed  approach  is 
applied  to  the  problem  of  estimating  tool  wear  in  a 
turning  application  and  is  shown  to  provide  wear  es¬ 
timates  which  are  in  close  agreement  with  published 
experimental  results. 
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STABILITY  ANALYSIS  FOR  A 
CLASS  OF  NEURAL  NETWORKS 

R.  Colbaugh* 

Abstract 

This  paper  considers  the  problem  of  characterizing  the  stability  prop¬ 
erties  of  the  equilibria  of  an  important  class  of  recurrent  neural  net¬ 
works.  Sufficient  conditions  are  given  under  which  the  neural  net¬ 
work  possesses  a  unique  globally  asymptotically  stable  equilibrium 
point  for  each  external  input.  These  conditions  are  less  restrictive 
than  those  previously  obtained  and  are  easily  checked,  so  that  incor¬ 
porating  them  in  existing  neural  network  design  procedures  should 
increase  the  flexibility  and  reduce  the  complexity  of  this  synthesis 
process.  Results  are  provided  for  both  continuous-time  and  discrete- 
time  networks. 
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1.  Introduction 

A  recurrent  neural  network  is  a  dynamical  system  com¬ 
posed  of  a  large  number  of  highly  interconnected  neuron¬ 
like  subsystems.  The  structure  of  these  networks  leads 
naturally  to  distributed  information  processing  and  an  in¬ 
herent  potential  for  efficient  parallel  computation.  As  a 
consequence,  neural  networks  have  attracted  considerable 
interest  as  candidates  for  robust,  high-performance  com¬ 
putational  systems.  Recognizing  the  potential  advantages 
of  these  systems  for  solving  complex  problems  in  real  time, 
many  researchers  have  studied  the  application  of  neural 
networks  to  a  wide  variety  of  automation  problems.  For 
example,  neural  networks  have  been  implemented  to  solve 
problems  in  optimization  (e.g.,  [1-4]),  control  theory  (e.g., 
[5-7]),  robotics  (e.g.,  [8-10]),  and  pattern  recognition  (e.g., 
[11-13]). 

One  of  the  most  popular  and  useful  classes  of  neural 
networks  proposed  to  date  is  the  additive  or  Hop  field-type 
network  [1,  2].  Fundamental  to  the  successful  implemen¬ 
tation  of  this  class  of  neural  network  as  a  solution  to  au¬ 
tomation  problems  is  an  understanding  of  the  qualitative 
properties  of  the  system  dynamics.  Of  particular  interest 
for  the  design  and  utilization  of  these  neural  networks  is 
a  characterization  of  the  network  equilibria,  including  the 
relationship  between  the  external  inputs  and  the  resulting 
equilibria  and  the  stability  properties  of  these  equilibria. 


In  many  applications  it  is  desirable,  or  even  essential,  to 
have  a  unique  globally  asymptotically  stable  equilibrium 
corresponding  to  each  external  input.  Thus  in  these  cases 
it  is  of  great  interest  to  the  designer  to  have  conditions 
on  the  neural  network  parameters  that  ensure  that  this 
situation  is  realized. 

Understanding  the  importance  of  this  problem,  many 
researchers  have  studied  the  qualitative  properties  of  the 
dynamics  of  additive  neural  networks  (e.g.,  [1,  2,  14-21]). 
Much  of  the  early  work  in  this  area  imposed  restrictive 
conditions  on  the  network  components  and  architecture 
to  ensure  the  existence  of  asymptotically  stable  equilibria. 
For  example,  a  common  assumption  was  that  the  network 
possessed  a  symmetric  connectivity  between  the  neurons, 
and  that  the  neuron  input/output  “activation”  functions 
were  smooth  (e.g.,  [1,  2,  14,  21]).  These  conditions  have 
been  found  to  impose  serious  limitations  on  the  capabili¬ 
ties  of  neural  networks,  and  as  a  consequence  recent  inves¬ 
tigations  have  sought  to  relax  these  restrictions  while  still 
ensuring  the  existence  of  asymptotically  stable  equilibria 
[15-20].  Most  of  this  work  has  focused  on  local  stabil¬ 
ity,  so  that  conditions  for  global  asymptotic  stability  of 
an  equilibrium  that  are  not  restrictive  and  that  can  be 
easily  checked  are  still  lacking.  Additionally,  most  of  the 
research  on  the  qualitative  dynamics  of  neural  networks 
has  considered  the  continuous- time  case,  despite  the  fact 
that  discrete-time  networks  have  proven  to  be  quite  useful 
in  applications. 

This  paper  considers  the  problem  of  characterizing 
the  stability  properties  of  the  equilibria  of  additive  neural 
networks.  Sufficient  conditions  are  given  under  which  the 
neural  network  possesses  a  unique  globally  asymptotically 
stable  equilibrium  point  for  each  external  input.  These 
conditions  are  simple  and  easily  checked,  and  are  less  re¬ 
strictive  than  those  obtained  in  other  investigations.  For 
example,  it  is  not  assumed  that  the  connectivity  between 
the  neurons  is  symmetric,  and  the  neuron  activation  func¬ 
tions  are  required  only  to  be  locally  Lipschitz.  The  focus 
of  the  work  is  on  continuous-time  neural  networks;  how¬ 
ever,  results  are  also  provided  that  ensure  the  analogous 
behaviour  in  the  discrete-time  case. 

2.  Preliminaries 


The  objective  of  this  paper  is  to  investigate  the  stabil¬ 
ity  properties  of  additive  neural  networks.  A  continuous¬ 
time  realization  of  this  class  of  networks  typically  takes 
the  form: 
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inhibitions  of  the  neurons  (so  that  the  a,-  are  positive  con¬ 
stants),  W  6  3tnXn  is  a  constant  matrix  whose  elements 
Wij  define  the  interconnection  strengths  between  the  neu¬ 
rons,  g  =  [^i(xi),^2(^2),**-J^n(:cn)]T  :  3tn  —►  3£n  is  a  lo¬ 
cally  Lipschitz  mapping  representing  the  input/output  ac¬ 
tivation  functions  of  the  neurons,  and  I  £  5£n  is  the  vector 
of  external  inputs  to  the  network.  In  what  follows  it  will 
be  assumed  that  the  functions  gi  are  bounded  and  strictly 
increasing,  and  that  the  solutions  of  (1)  are  defined  for  all 
time  and  exhibit  continuous  dependence  on  initial  condi¬ 
tions. 

In  discrete  time,  the  additive  neural  network  model  is 
ordinarily  taken  to  be: 

Xjt+i  =  Wg(xfc)  +  I  (2) 

where  all  terms  are  defined  as  in  the  continuous-time 
model.  This  representation  is  usually  referred  to  as  a 
synchronous  discrete-time  network,  and  we  note  that  this 
version  possesses  implementation  advantages  over  models 
that  switch  asynchronously  [19]. 

In  what  follows,  we  shall  be  concerned  with  the  sta¬ 
bility  properties  of  the  equilibria  of  the  neural  network 
models  (1),(2).  Therefore,  an  important  component  of  the 
analysis  consists  in  establishing  the  existence  of  an  equi¬ 
librium  solution  for  a  nonlinear  differential  or  difference 
equation.  The  next  two  lemmas  will  be  helpful  in  this 
regard. 

Lemma  1:  Consider  the  continuous-time  dynamical 
system: 

x=  Bx-ff(x)  (3) 

where  B  £  3 Jnxn  is  constant  and  nonsingular  and  f  :  3tn  — ► 
3 in  is  locally  Lipschitz.  If  there  exists  a  constant  K  >  0 
such  that  ||  f(x)  ||<  I<  Vx  then  (3)  has  an  equilibrium 
solution  xeq  (i.e.,  xeq  satisfies  Bxeq  +f(xC9)  =  0). 

Proof:  We  seek  a  solution  to  Bx  +  f(x)  =  0  or,  equiv¬ 
alently,  to  the  fixed-point  problem: 

x=  -B'lf(x)  (4) 

Now  the  boundedness  of  f  and  invertibility  of  B  ensures 
the  existence  of  a  constant  R  >  0  such  that  ||  Z?_1f(x)  ||< 
R  Vx.  Thus  5_1f(x)  maps  the  ball  £#(())  (of  radius  R 
centred  at  the  origin)  into  itself,  and  the  Brouwer  fixed- 
point  theorem  then  ensures  that  B-1f(x)  has  a  fixed  point 
in  .Br(O)  [22],  This  fixed  point  is  an  equilibrium  solution 
for  (1).  QED 

Lemma  2:  Consider  the  discrete-time  dynamical  sys¬ 
tem: 

xjfc+i  =  f(xt)  (5) 

where  f  :  3Jn  — ►  3Jn  is  continuous.  If  there  exists  a  con¬ 
stant  R  >  0  such  that  ||  f(x)  ||<  R  Vx  then  (5)  has  an 
equilibrium  solution  xc^  (i.e.,  xtq  satisfies  xeq  =  f(xc?)). 

Proof:  Here  we  have  immediately  a  fixed-point  prob¬ 
lem,  so  that  the  boundedness  of  f(x)  and  the  Brouwer 
fixed-point  theorem  again  ensure  the  existence  of  a  fixed- 
point  (in  Br(0))  and  therefore  an  equilibrium  solution  for 
(5).  QED 


Our  approach  to  analyzing  the  stability  of  the  equilib¬ 
rium  solutions  to  the  dynamical  systems  (1)  and  (2)  is  to 
utilize  Lyapunov’s  direct  method  [22].  The  following  two 
lemmas  summarize  certain  facts  regarding  this  method  of 
stability  analysis  and  will  be  of  direct  relevance  in  our  de¬ 
velopment.  The  first  lemma  recalls  a  useful  result  concern¬ 
ing  the  stability  of  continuous-time  dynamical  systems. 

Lemma  3:  Consider  the  continuous-time  dynamical 
system  x  =  f(x),  where  f  :  3in  — ►  3 in  is  locally  Lips¬ 
chitz  and  such  that  f(0)  =  0.  If  there  exists  a  continu¬ 
ously  differentiable  function  V  :  3in  — ►  3£  that  is  positive- 
definite,  radially  unbounded,  and  such  that  the  Lie  deriva¬ 
tive  V  =  (dV/dx)f  is  negative-definite,  then  the  equilib¬ 
rium  solution  x  =  0  is  globally  asymptotically  stable. 

Proof:  The  proof  can  be  found  in,  among  other 
sources,  [23].  QED 

.The  next  lemma  provides  the  analogous  result  for 
discrete-time  dynamical  systems. 

Lemma  4:  Consider  the  discrete-time  dynamical  sys¬ 
tem  xjb+i  =  f(xjt),  where  f  :  3?n  — ►  3£n  is  continu¬ 
ous  and  such  that  f(0)  =  0.  If  there  exists  a  continu¬ 
ous  function  V  :  3£n  — ►  3i  that  is  positive-definite,  ra¬ 
dially  unbounded,  and  such  that  the  forward  difference 
A 14  =  l/(xjb+i)  —  Vr(xt)  is  negative-definite,  then  the  equi¬ 
librium  solution  x  =  0  is  globally  asymptotically  stable. 

Proof:  The  proof  can  be  found  in,  among  other 
sources,  [23].  QED 


In  this  section  we  investigate  the  stability  properties  of  the 
continuous-time  neural  network  (1).  The  purpose  of  this 
study  is  to  present  sufficient  conditions  under  which  the 
network  possesses  a  unique  globally  asymptotically  stable 
equilibrium  solution  for  each  external  input  I.  Our  main 
results  are  summarized  in  the  following  two  theorems. 

Theorem  1:  The  neural  network  (1)  possesses  a 
unique  globally  asymptotically  stable  equilibrium  solution 
for  each  I  provided  that  the  matrix  W  +  WT  is  negative 
semidefinite. 

Proof:  The  boundedness  of  g  and  invertibility  of  A  to¬ 
gether  with  Lemma  1  permit  the  conclusion  that  (1)  has  an 
equilibrium  solution  xcq .  Introducing  the  new  coordinates 

q  =  x-xe,  and  defining  f(q)  =  [/i(?i),  ...,/n(?n)]T  = 
g(q  +  x^)  —  g(xe?)  allows  (1)  to  be  written  as: 

q=-Aq  +  Wf{q)  (6) 

where  f(0)  =  0  (so  that  q  =  0  is  an  equilibrium  solution 
for  (6))  and  the  /,•  are  locally  Lipschitz,  bounded,  and 
strictly  increasing  (because  the  gi  are). 

Consider  now  the  Lyapunov  function  candidate 


1 


v(q)  =  X)  /  fi(y)dy  (7) 

»=1 

and  observe  that  the  characteristics  of  f  ensure  that  V  is 
continuously  differentiable,  positive-definite,  and  radially 
unbounded.  Computing  the  derivative  of  (7)  along  (6)  and 
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simplifying  yields:  _ 

V  =  -fTAci  +  ifT(I V  +  WT)  f 

<  — min(at)  ||  f  ||||  q  ||  +  l-\mai{W  +  WT)\\  f  ||2 

The  negative-semidefiniteness  of  W  +  WT  implies  that 
^ mar  (W  +  WT )  <  0,  so  that  V  is  negative-definite  and 
the  conclusions  of  the  theorem  then  follow  from  Lemma  3. 
QED 

The  condition  that  all  of  the  eigenvalues  of  the  ma¬ 
trix  W  -\-WT  must  be  nonpositive  can  be  relaxed  if  there  is 
some  additional  information  available  regarding  the  func¬ 
tions  <7i(x*);  this  result  is  the  subject  of  the  next  theorem. 

Theorem  2:  Consider  the  continuous-time  neural  net¬ 
work  (1),  rewritten  as  in  (6).  Suppose  that,  in  addition  to 
the  previous  conditions  on  f(q),  it  is  known  that  the  func¬ 
tions  fi(qi)  satisfy  |/,*(<7i)|  <  for  some  positive  scalar 
constants  cx*.  Let  amin  =  min(ax)  and  cmax  =  max(c,-).  If 
the  matrix  W  is  such  that  \max{W  +  WT )  <  2 amin/cmax 
then  (6)  (and  therefore  (1))  possesses  a  unique  globally 
asymptotically  stable  equilibrium  solution  for  each  I. 

Proof:  Differentiating  the  Lyapunov  function  candi¬ 
date  (7)  along  the  solution  trajectories  of  (6)  and  sim¬ 
plifying  as  in  the  proof  of  Theorem  1  gives  the  following 
upper  bound  on  V: 

v  <  -amin  ||  q  nil  f  ||  +  jA max{W  +  WT)  ||  f  ||2 

Thus  V  is  negative-definite  if  the  condition  on  W  stated  in 
the  theorem  is  satisfied,  and  the  conclusions  of  the  theorem 
then  follow  from  Lemma  3.  QED 

Several  observations  can  be  made  concerning  the  re¬ 
sults  summarized  in  Theorems  1  and  2.  First,  note  that 
the  global  nature  of  the  stability  conclusions  implies  that 
there  is  a  unique  equilibrium  solution  for  each  external  in¬ 
put  I;  indeed,  it  is  easy  to  see  that  the  map  F  :  3tn  — ►  3fn 
taking  input  vectors  I  to  equilibria  is  a  bijection.  This 
latter  result  is  important  for  many  applications.  Next, 
observe  that  the  conditions  on  the  neural  network  param¬ 
eters  and  architecture  that  guarantee  global  convergence 
to  unique  equilibria  are  less  restrictive  than  those  obtained 
previously  (e.g.,  [1,  2,  14-21]).  For  example,  it  is  not  as¬ 
sumed  that  the  interconnection  matrix  W  is  symmetric  or 
that  the  activation  function  g  is  differentiable.  Addition- 
ally,  it  can  be  seen  that  these  conditions  are  easily  checked. 
These  latter  features  can  be  useful  for  increasing  the  flex¬ 
ibility  and  reducing  the  complexity  of  the  neural  network 
design  process. 

4.  Discrete-Time  Neural  Networks 

In  this  section  we  study  the  stability  properties  of  the  equi-  2 
libria  of  the  discrete- time  neural  network  (2).  As  in  the 
previous  section,  the  objective  is  to  present  sufficient  con¬ 
ditions  for  the  existence  of  a  unique  globally  asymptoti¬ 


cally  stable  equilibrium  solution  for  each  external  input  I. 
Observe  first  that  from  the  boundedness  of  g  and  Lemma  2 
it  can  be  concluded  that  (2)  has  at  least  one  equilibrium 
solution  for  each  I.  Utilizing  the  coordinate  q  and  the 
function  f(q)  introduced  in  the  previous  section  permits 
(2)  to  be  written  in  the  following  form: 

qjt+i  =  WT(qfc)  (8) 

where  again  f(0)  =  0  so  that  q  =  0  is  an  equilibrium 
solution. 

An  immediate  observation  is  that  if  Wf  is  a  con¬ 
traction  mapping,  so  that  ||  WT(qi)  -  Wf(q2)  ||<  *i- 
II  qi  —  <12  ||  for  some  positive  constant  kx  <  1  and  for 
all  qi,q2,  then  the  origin  is  the  unique  globally  asymp¬ 
totically  stable  equilibrium  solution  corresponding  to  the 
given  input  I.  This  situation  occurs,  for  example,  if  f  is 
globally  Lipschitz  with  Lipschitz  constant  jfc2  >  0  and  the 
maximum  singular  value  <rmax  of  W  satisfies  <rmax  <  1  /Jb2. 
Although  this  condition  is  easily  checked,  it  is  too  restric¬ 
tive  for  many  applications.  An  alternative  condition  can 
be  established  using  Lyapunov’s  direct  method;  it  is  the 
content  of  the  next  theorem. 

Theorem  3:  Consider  the  discrete-time  neural  network 
(8)  (or,  equivalently,  (2))  -and  suppose  that  the  /*  satisfy 
!/,*(#)  1  <  ct-|gt-|  for  some  scalar  constants  c{  >  0.  Define 
C  =  In  —  D,  where  In  is  the  n  x  n  identity  matrix  and 
the  elements  cfXJ-  of  D  are  given  by  d{j  ~  \wij\cj.  If  the 
diagonal  elements  of  W  and  C  are  nonzero  and  positive, 
respectively,  and  the  successive  principle  minors  of  C  are 
all  positive,  then  the  origin  is  a  globally  asymptotically 
stable  equilibrium  solution  of  (8). 

Proof:  Define  a  vector  of  positive  scalar  constants 
A  G  3in  and  denote  by  |q|  the  vector  of  absolute  values 
of  elements  of  q,  that  is,  jq|  =  [|?i  |, kn|]T-  Consider 
the  Lyapunov  function  candidate: 

^(qfc)  =  AT|qjfc|  (9) 

and  note  that  V  is  continuous,  positive-definite,  and  radi¬ 
ally  unbounded.  Computing  the  forward  difference  of  V 
along  the  solutions  of  (8)  yields,  after  simplification: 
and  note  that  V  is  continuous,  positive-definite,  and  radi¬ 
ally  unbounded.  Computing  the  forward  difference  of  V 
along  the  solutions  of  (8)  yields,  after  simplification: 

A  Vi  =  AT(|WT(qi)|  -  |q*|) 
<AT(|^||[cqt]|-|qfc|) 

=  -ATC|q*| 

where  c  =  [c,, ...,  c„]T  and  the  notation  [uw]  = 

[ujuii, ...,  u„wn]T  €  3ft”  is  introduced.  The  conditions  on 
the  matrix  C  given  in  the  statement  of  the  theorem  are 
sufficient  to  ensure  that  the  row-vector  A TC  had  all  posi¬ 
tive  elements  [e.g.,  24].  Thus  A14  is  negative-definite  and 
Lemma  4  implies  that  the  origin  is  a  globally  asymptoti- 
_  ]$$  cally  stable  equilibrium  solution  for  (8).  QED 

The  observations  made  regarding  the  stability  results 
for  continuous- time  neural  networks  apply  in  the  discrete- 
time  case  as  well.  For  example,  it  can  again  be  seen  that 
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the  map  F  :  3Jn  -*•  ft"  taking  input  vectors  I  to  equilib- 
ria  xe,  is  bijective,  which  is  useful  for  many  applications. 
Additionally,  the  conditions  on  the  neural  network  param¬ 
eters  and  architecture  that  guarantee  global  convergence 
to  unique  equilibria  are  not  restrictive  and  can  be  checked 
efficiently,  which  can  reduce  the  difficulty  of  the  neural 
network  synthesis  process. 

5.  Conclusion 

This  paper  presents  sufficient  conditions  under  which  ad- 
ditive  neural  networks  possess  a  unique  globally  asymp¬ 
totically  stable  equilibrium  point  for  each  external  input. 
These  conditions  are  simple  and  easily  checked,  and  are 
less  restrictive  than  those  obtained  in  other  studies.  For 
example,  it  is  not  assumed  that  the  connectivity  between 
the  neurons  is  symmetric  or  that  the  neuron  activation 
functions  are  differentiable.  Both  continuous-time  and 
discrete-time  implementations  of  the  additive  neural  net¬ 
work  model  are  considered.  Extensive  computer  simula¬ 
tions  have  been  conducted  to  verify  that  the  conditions 
do  indeed  ensure  convergence  of  the  network  to  the  de¬ 
sired  equilibrium;  these  simulations  are  reported  in  [25], 
Future  research  will  focus  on  the  application  of  recurrent 
neural  networks  to  the  problem  of  estimating  tool  wear  in 
machining  operations  [26]. 
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Material-Cutting  Tool  Interfaces  and  Ion  Implantation 
Research  Task  3.4  Final  Report 


EXECUTIVE  SUMMARY 

The  first  project  objective  to  determine  the  overall  surface  characteristics  of  the  tool 
edge/substrate  interface  and  the  correlated  determination  of  the  chemical  composition  of 
the  surfaces  of  that  interface  was  experimentally  driven,  confined  to  drill  bits  and  finally 
analyzed  using  Raman  spectroscopy.  This  work  addresses  the  drilling  operation  when  special 
additives  have  been  added  to  a  petroleum  based  lubricant.  The  purpose  of  these  special 
additives  is  to  enhance  the  penetration  of  carbon  bearing  liquids  into  the  area  of  the  cutting 
edge  event,  and  to  aid  in  the  generation  of  veiy  hard  carbon  structures,  resulting  in  new  cutting 
parameters  based  on  the  experimental  findings.  Theoretical  arguments  have  shown  how  the  in- 
situ  generation  of  hard  carbon  species  can  increase  tool  life  by  reducing  operating  wear. 
Experimental  verification  of  the  hypothesis  requires  the  generation  of  measurable  amounts  of 
carbon  at  the  tool/substrate  interface.  Follow-on  research  should  focus  on  a  method  to 
generate  a  substantial  amount  of  the  carbonaceous  material  identified  by  spectroscopy  as  being 
possibly  hard  carbon”.  A  planned  for  journal  publication  by  Prof  Tim  Bentley  and  Prof  Leon 
Cox  titled  “Raman  Measurements  of  Carbon  Complexes  Formed  at  the  Tool/Workpiece 
Interface  in  a  Lubricated  Drilling  Operation”  is  included. 

The  second  objective  to  develop,  implement  and  verify  the  results  consists  of  work  by  Prof. 
Leon  Cox  and  Dr.  Amjed  Al-Ghanim  devising  a  methodology  of  collecting,  storing  and 
utilizing  machining  strategy  data  that  utilized  Artificial  Neural  Networks  (ANN’s).  This  work 
was  initiated  in  task  3.1  and  is  included  here  because  it  was  a  concurrent  task  to  absorb  the 
results  of  the  experimental  findings.  The  paper  published  in  ASME  Press  Intelligent 
Engineering  Systems  Through  Artificial  Neural  Networks  titled  “Machining  Strategies 
Collected,  Stored  and  Utilized  with  ANN’s”  is  included. 

The  third  objective  to  determine  the  feasibility  of  utilizing  ion  implanted  cutting  tools 
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Raman  Measurements  of  Carbon  Complexes  Formed 
at  the  Tool/Workpiece  Interface 
in  a  Lubricated  Drilling  Operation 


James  H.  Bentley  and  Leon  Cox 
New  Mexico  State  University 
Las  Cruces,  NM  88003 


ABSTRACT 

Cutting  fluids  have  been  characterized  in  machining  operations  for  lubricity  and/or  cooling. 
The  chemistry  of  cutting  fluids  in  a  drilling  operation  is  examined  by  means  of  analyses  of 
Raman  spectra  of  carbonaceous  materials  present  at  the  cutting  edge  of  a  high-speed  steel  drill 
bit,  with  a  selected  variety  of  cutting  fluids  and  additives. 


I.  INTRODUCTION 

At  some  point  in  product  development,  modem  manufacturing  almost  always  makes  use  of 
discrete  machining  operations,  including  milling,  drilling,  or  turning.  The  efficacy  of  any 
machining  operation  is  dependent  upon  several  operational  parameters,  including  the  nature  of 
the  cutting  fluid,  which  may  be  used  with  or  without  additives.  This  work  addresses  the  drilling 
operation  when  special  additives  have  been  added  to  a  petroleum  based  lubricant.  The  purpose 
of  these  special  additives  is  to  enhance  the  penetration  of  carbon  bearing  liquids  into  the  area  of 
the  cutting  edge  event,  and  to  aid  in  the  generation  of  very  hard  carbon  structures,  resulting  in 
new  cutting  parameters  based  on  the  experimental  findings. 

H  EXAMINATION  OF  THE  PROBLEM 

Drilling  is  a  complex  3 -dimensional  cutting  operation.  The  outer  edge  of  the  drill  bit  produces 
chips  by  shearing  while  the  material  under  the  chisel  edge  is  subjected  to  severe  deformation  by 
great  localized  heat  and  pressure.  In  any  detailed,  microscopic  examination  of  the  events 
occurring  at  the  cutting  edge  of  a  sharp,  lubricated  cutting  tool,  the  subject  rapidly  turns  to  the 
field  of  chemistry.  This  is  so  because  the  events  that  occur  at  the  cutting  edge  are  at  high 
localized  pressure  and  temperature.  Such  conditions,  at  times  with  oxygen  present  and  at  times 
with  little  or  no  oxygen  present,  can  act  as  a  micro  chemical  reactor,  and  result  in  the 
disproportionation  of  organic  lubricant  molecules  and  the  recombination  of  molecular  fragments 
into  products  very  different  from  the  starting  materials.  If  some  of  the  disproportionated 
carbonaceous  material  should  recombine  either  as  adamantane  carbon  or  any  other  type  of 
structurally  complex  carbon  which  is  very  hard  (1),  the  implications  for  machine  tooling  can  be 
extremely  important. 

Cutting  fluids  are  utilized  extensively  in  machifin^  operations  as  coolants  and/or  lubricants. 


1 


When  properly  applied,  cutting  fluids  can  increase  productivity  and  reduce  costs  through  greater 
speeds,  feeds,  and  depths  of  cut  which  are  the  major  factors  determining  tool-life.  The  purpose 
of  cutting  fluids  is  to  increase  tool  life  by  reducing  friction  and  heat.  This  is  done  by  reducing  the 
forces  exerted  by  the  machine  tool,  cooling  the  cutting  zone,  reducing  distortions,  washing  away 
chips,  and  protecting  newly  machined  surfaces  from  environmental  corrosion. 

The  asperities  between  the  cutting  tool  and  workpiece  are  effective  storage  places  for  small 
quantities  of  cutting  fluids  which  are  then  subjected  to  the  severity  of  the  machining  operation. 
The  mechanism  of  the  cutting  fluid  penetrating  to  the  source  of  the  tool/workpiece  interface  has 
been  shown  to  be  capillary  action,  and  is  a  function  of  the  molecular  size  of  the  cutting  fluid 
components,  among  other  characteristics  (4).  Since  molecules  with  high  dipole  moments  are 
very  good  at  inducing  inverted  (and  therefore  attracting)  dipoles  in  highly  polarizable  materials, 
they  should  bind  electrostatically  to  the  surfaces  of  cutting  tool/substrate  events  to  a  greater 
degree  than  hydrocarbon  molecules  having  little  or  no  dipole  moments.  This  is  especially  true  of 
metallic  tools  and  substrates,  since  metals  possess  a  great  deal  of  loosely  held  electron  density, 
which  makes  them  very  easy  to  polarize.  If  the  additive  molecules  are  good  organic  solvents  to 
boot,  they  will  solvate  the  normal  lubricant  molecules,  resulting  in  still  more  hydrocarbon  density 
and  therefore  available  carbon  density  at  the  cutting  edge. 

Theoretical  arguments  have  shown  how  the  in-situ  generation  of  hard  carbon  species  can 
increase  tool  life  by  reducing  operating  wear.  Experimental  verification  of  the  hypothesis  requires 
the  generation  of  measurable  amounts  of  carbon  at  the  tool/substrate  interface. 


Iff.  EXPERIMENT 


General 


Standard,  off-the-shelf,  118  degree,  one  quarter  inch  diameter  high  speed  steel  drill  bits  were 
chosen  for  experimental  use  and  were  plunged  into  the  end  of  one  inch  diameter  AISI  1020  cold 
rolled  steel  bars  in  a  vertical  orientation.  The  vertical  orientation  assured  easy  retention  of  the 
lubricant  and  its  breakdown  products.  The  process  consisted  of  drilling  in  a  standard  deep-hole 
cycle  manufacturing  method.  Deep-hole  drilling  in  steel  was  used  because  it  best  fits  computer 
numerical  controlled  (CNC)  machining  practices  found  in  industry  today.  Further,  the  CNC 
machine  tool  was  instrumented  with  a  load  cell  and  the  forces  of  the  plunge  were  recorded  with 
a  time-based  plotter.  It  was  suspected  that  results  from  the  force  diagrams  might  show  a 
difference  according  to  cutting  fluid.  The  result  of  this  expectation  was  inconclusive  due  to  the 
inadequacies  of  the  load  cell,  and  the  physical  variability  of  the  drilling  operation.  Appendix  A 
contains  the  time  based  plots  where  the  first  force/time  portion  of  the  plot  is  without  cutting  fluid 
and  the  second  is  after  tool  withdrawal  and  the  cutting  fluid  injected  into  the  cavity.  A 
computerized  data  acquisition  system  utilizing  the  National  Instruments  I/O  board  and  the 
LabView  software  to  capture  this  data  was  also  initiated  but  finally  could  not  be  used  because  of 
the  limitations  in  the  software  program. 

Parameters  for  the  drilling  operation  were  calculated  from  the  Metcut  data  handbooks  (3:  p3-5). 
Cutting  speed  was  determined  to  be  90  feet  perirtiriute  and  the  resulting  Speed=1600  rpm.  Each 
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successive  plunge  depth  was  set  for  0.15  inches,  which  is  just  over  one-half  the  diameter  of  the 
tool.  Feed  rate  was  established  at  0.006  inches  per  revolution,  according  to  the  handbook. 

Lubricants  and  Additives 


Lubricating  oils  used  in  machining  operations  normally  consist  of  two  general  types  of  organics  - 
paraffinic  and  aromatic.  Most  oils  are  a  mixture  of  these  materials.  Of  course  many  oils  also  have 
additives  for  special  purposes,  such  as  "clean"  cutting  and  "smooth"  machining.  Since  in  this 
investigation  our  purpose  is  to  discover  adamantane  or  other  very  hard  types  of  carbon,  and 
since  it  is  evident,  a  priori,  that  large  quantities  of  any  kind  of  carbon  do  not  remain  in-situ  at  the 
normal  cutting  tool  interface,  compounds  have  been  added  to  cutting  fluids  whose  effects  may  be 
to  generate  more  such  carbon  than  would  normally  be  generated  in  ordinary  metal  cutting 
operations. 

The  following  comments  on  lubricants  will  list  the  lubricant  according  to  a  number  code,  and 
make  specific  comments  as  to  how  that  particular  lubricant/additive  combination  may  be  unique 
in  its  function.  All  the  mixes  are  50/50  by  volume. 

The  basic  lubricants  consisted  of  a  mixed  hydrocarbon  fraction  taken  from  175-210  degrees  C, 
light  machine  oil,  30  wt.  Oil,  and  cutting  oil  with  sulfur. 

The  additives  were: 

1.  Styrene.  An  aromatic  material  which  has  considerable  unsaturation.  A  relatively  flat,  ring 
molecule  which  can  fit  easily  into  small  crevasses,  this  material  can  also  readily  polymerize  into  a 
linear  polymer  containing  considerable  aromatic  unsaturation. 

2.  Iodomethane.  A  very  simple,  small  molecule  containing  the  halogen,  iodine.  This  covalent 
iodine  compound  will  possess  a  smaller  dipole  moment  that  its  other  counterparts,  and  the  iodine 
atom  is  very  large.  However  the  iodine  atom  is  considerably  polarizable,  and  will  respond  to 
localized  dipole  moments  better  than  the  other  halogens. 

3.  2,4-Pentcmedione.  This  molecule  is  a  strong  covalent  oxygen  chelator  for  positive  centers, 
which  forms  a  flat  structure  when  it  chelates. 

4.  Propargyl  chloride.  A  small  molecule  containing  a  lot  of  non-aromatic  unsaturation  and  a 
large  dipole  moment  developed  by  the  allylic  chlorine.  This  material  will  not  polymerize  under 
normal  conditions. 

5.  S-Methyl-2-butanone.  An  oil  thinning  organic  solvent  containing  a  large  dipole  moment  in  the 
ketonic  carbonyl. 

6.  1,2-Dichlorobenzene.  An  unsaturated,  aromatic  material  with  considerable  molecular  dipole 
moment  in  the  aromatic  nucleus.  This  is  a  relatively  small,  flat,  ring  molecule  which  will  be  able 
to  penetrate  into  small  crevasses. 
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7.  Sulfolane  or  tetramethylene  sulfone.  This  molecule  possesses  covalent  sulfur  in  a  reasonably 
small  ring.  The  sulfur  is  in  an  oxidized  state  and  has  considerable  dipole  moment.  Sulfolane  is 
infinitely  miscible  with  water  and  will  not  mix  as  well  with  regular  lubricants  as  many  of  the  other 
additives  will.  However  it  will  act  to  add  water  to  the  lubricant,  gleaned  from  atmospheric 
sources  or  from  the  results  of  the  high  temperature  breakdown  of  hydrocarbons  at  the  cutting 
interface. 

8.  Acetonitrile.  This  is  a  very  small  molecule  possessing  a  large  dipole  moment  in  the  covalent 
cyano  group.  It  should  act  as  a  thinner,  but  may  be  too  polar  to  mix  well  with  hydrocarbon 
lubricants.  It  also  possesses  considerable  unsaturation  in  the  carbon/nitrogen  triple  bond. 

9.  Tetramethylenediamine.  This  molecule  is  a  moderate  sized  nitrogen  based  non-covalent 
chelator  for  positive  centers.  It  possesses  moderate  dipole  moments,  but  should  be  nicely  organic 
soluble  due  to  the  complete  methylation  of  the  amine  groups. 

10.  1,4-Dioxane.  A  small  but  somewhat  chair-shaped  ring  molecule  of  great  organic  solvent 
power,  having  two  oxygens  on  opposite  sides  of  the  ring.  The  molecule  possesses  considerable 
internal  dipole  moments  but  no  net  molecular  dipole  moment. 

11.  Dimethylacetamide.  A  small  molecule  which  can  slip  into  molecular  crevasses  easily.  This 
material  is  a  powerful  solvent,  containing  a  considerable  dipole  moment,  and  is  also  water 
miscible. 

12.  1,1,1-Trichloroethane.  A  highly  chlorinated  molecule  containing  both  large  internal  and 
external  dipole  moments. 

13.  Dibromoethane.  A  fairly  small  molecule  with  covalent  bromine  on  adjacent  carbons.  This 
material  will  have  dipole  moments,  but  will  be  less  polar  and  more  polarizable  than  the 
corresponding  chlorine  compound. 

14.  Chloroform.  This  is  a  small  molecule  with  a  large  dipole  moment,  which  will  enter  into  small 
crevasses  and  bind  well  with  metals  by  inducing  considerable  polarization. 

15.  Phenylacetylene.  A  fairly  small,  aromatic  ring  molecule  with  considerable  unsaturation  This 
material  could  be  a  very  good  source  of  "breakdown"  carbon  for  the  proposed  adamantane 
structure,  due  to  the  great  amount  of  unsaturation  and  the  great  mobility  of  the  small,  flat 
molecule.  The  molecule  is  highly  polarizable. 

Additive  Characteristics 

Generally  there  are  three  separate  individual  characteristics  which  report  to  have  beneficial 
effects  for  a  lubricant  additive,  with  respect  to  our  stated  thesis  of  the  possible  generation  of  hard 
carbon  at  the  cutting  tool  edge.  These  are  as  follows. 

1.  Molecular  Dipole  Moment.  This  characteristic  helps  define  the  extent  to  which  the  molecule 
will  generate  induced  dipole  moments  on  the  t&o^bit  surface.  This,  in  turn,  will  determine  the 
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extent  to  which  the  molecule  will  establish  dipole  bonding  and  "wet"  the  tool  bit  surface,  and 
cause  it  to  be  maximally  present  at  the  cutting  edge  of  the  machine  tool. 

2.  Molecular  Physical  Size/Shape.  The  molecular  size,  or  vander  Waal  radius,  determines  how 
well  the  molecule  is  able  to  pack  into  small  tool  bit  crevasses,  according  to  the  size  of  the 
crevasse  and  their  mutual  steric  interference.  The  effective  "size"  of  a  molecule  is  determined  by 
the  number  of  atoms  in  the  structure  and  the  size  of  the  atoms.  But  it  is  also  determined  by  the 
three  dimensional  shape  of  the  molecule  as  well.  A  flat  molecular  structure,  such  as  a  phenyl 
group,  will  occupy  less  volume  than  the  corresponding  saturated  structure. 

3.  Degree  of  Unsaturation.  The  degree  of  unsaturation  determines  how  much  the  starting 
molecule  is  similar  to  carbon  at  the  beginning.  This  simply  means  that,  molecule  for  molecule, 
there  are  fewer  interfering  atoms  to  deal  with  before  rearranging  the  molecule's  molecular 
structure  into  that  of  pure  carbon.  The  degree  of  unsaturation  of  a  molecule  possessing  a  phenyl 
group  is  three,  and  one  possessing  a  phenyl  group  plus  an  additional  caibon/carbon  double  bond 
(such  as  styrene)  is  four.  It  is  important  to  note  that  this  quick  analysis  does  not  address  the  type 
of  carbon  which  may  be  formed  -  that  is,  the  state  of  hybridization  of  the  carbon  atoms  or  the 
physical  structures. 

There  is  an  additional  characteristic  to  consider  concerning  unsaturation  -  that  of  the 
polymerizability  of  the  molecule.  A  molecule  such  as  styrene  is  strongly  polymerizable,  where  a 
very  similar  molecule,  phenyl  acetylene,  is  not  normally  polymerizable.  "Normally"  may  be  the 
key  operative  word,  here,  since  the  conditions  of  the  cutting  tool  edge  can  be  considered  not 
normal,  according  to  polymerization  technology,  and  the  acetylene  molecule  may  indeed  be  able 
to  form  a  complicated,  at  least  partially  unsaturated  polymer  chain  under  the  unusual  conditions. 
The  presence  of  a  carbon  backbone  polymer  at  the  cutting  edge  event  could  alter  the  physics  of 
molecular  surface  force  attachment,  and  the  subsequent  chemistry  done  at  the  cutting  edge.  Any 
effects  of  polymerizations  will  not  be  examined  specifically. 

The  lubricant  additives  were  chosen  in  an  effort  to  identify  the  types  of  molecular  structures 
which  will  be  most  efficacious  in  generating  diamond-like  or  other  very  hard  carbon  structures 
(1)  in  the  cutting  edge  event.  Simple  dipole  containing  molecules  were  compared  with  more 
complicated  molecules,  possessing  more  diversified  and  more  extensive  dipole  structures,  and  in 
some  cases  possessing  highly  polarizable  structures  themselves.  Dipole  generators  of  electron 
withdrawing  entities  from  simple  halogens  to  cyano  groups  were  compared,  with  carbon/carbon 
double  and  triple  bonds  acting  as  charge  dipole  extenders  in  some  cases. 

Spectrometric  Instrumentation 

Raman  measurements  required  the  drill  bit  samples  to  be  spectrally  clean,  and  necessitated 
cleaning  by  submersion  and  ultrasonic  agitation  in  tetrahydrofuran,  followed  by  spectrally  pure 
methanol.  This  was  followed  by  air  drying  and  storage  in  polyethylene  film. 

The  samples  were  taken  to  Instruments  S.A.  (SPEX)  in  Edison,  NJ.  and  under  the  supervision 
of  Howard  Schaffer  of  SPEX  they  were  examined  using  a  T64000  microprobe  equipped  with  a 
CCD  detection  system  and  the  SpectraMax  [tmf  spectroscopic  acquisition  and  analysis  software. 
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The  data  from  the  SpectraMax  was  output  into  a  generic  spreadsheet  format.  Specific  results 
are  shown  in  the  attached  Raman  graphs. 

IV.  RESULTS 


General 


Making  measurements  on  the  small  amounts  of  reaction  products  generated  by  a  drilling  event 
proved  to  be  very  difficult.  It  was  earlier  expected  that  in-situ  Auger  electron  spectroscopy  or 
even  electron  diffraction  of  laboratory  isolated  reaction  materials  could  be  used  to  identify  the 
carbon  products  of  the  drilling  procedure.  This  proved  impossible.  The  instrumentation  available 
to  the  university  at  that  time  consisted  of  surface  analysis  equipment  manufactured  by  Perkin- 
Elmer,  located  at  the  University  of  Texas  at  El  Paso  (UTEP)  and  the  NASA  WSTF  facility 
located  NE  of  Las  Cruces.  The  spatial  resolving  power  of  the  UTEP  instrumentation  was  not 
sufficient  to  do  analyses  on  drill  bits.  The  NASA  instrumentation  could  not  differentiate  between 
sp3  and  sp2  carbon  without  an  X-ray  attachment  which  would  have  cost  approximately  $75,000 
to  install.  Even  then  the  differentiation  would  require  far  more  carbonaceous  material  than  we 
could  possibly  make  available. 

Raman  Utilization 


If  one  is  trying  to  detect  the  presence  of  diamond-like  carbon  in  in  the  presence  of  graphitic  and 
other  more  complicated  thin  film  carbon  structures,  Raman  technology  may  be  the  technology 
of  choice.  The  ability  of  microRaman  (MR)  spectroscopy  to  perform  MR  measurements  on  a 
surface  spot  size  as  small  as  about  one  micron  is  extraordinarily  useful  for  surface  measurements 
dealing  with  the  very  small  areas  defined  by  machine  tool  cutting  edges.  This  applies  specifically 
to  the  drill  bit  cutting  edge. 

Since  Raman  spectroscopy  has  been  utilized  for  characterizing  hard  carbon  film  coatings  on 
computer  discs,  we  reasoned  that  it  would  work  for  our  experiments.  Further,  it  has  been  found 
that  spectral  characteristics  can  be  quantitatively  correlated  with  the  tribological  properties  of  the 
films  (2).  Therefore,  microRaman  spectroscopy  was  chosen  for  the  measurements. 

In  our  investigation,  we  focused  the  MR  laser  as  close  to  the  cutting  edge  of  the  drill  bit  as 
seemed  reasonable,  and  made  a  number  of  measurements  along  each  drill  bit  edge.  The  major 
area  of  interest  was  at  the  cutting  edge  at  about  3/4  of  the  radius  from  the  drill  bit  axis.  We  chose 
this  distance  because  it  was  the  distance  from  the  axis  where  maximum  heat  and  pressure  should 
be  generated  in  order  to  cause  disproportination  of  lubricant  molecules,  and  recombination  of 
elemental  carbon  into  the  forms  found  in-situ. 

Literature  Comparisons 

The  very  sharp  resonance  for  classical  diamond  in  Raman  spectroscopy  is  located  at  1332  cm-1. 
There  are  two  other  broad  peaks  that  may  be  present  in  the  Raman  spectra  of  carbon  films  in  the 
immediate  vicinity  of  the  diamond  peak.  These  have  been  defined  in  the  literature  as  the  mixed 
double  and  single  bonded  carbon  (or  sp2  and4sp8  hybridized)  band  and  the  graphitic  (or  sp2 
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hybridized)  bands,  containing  predominately  carbon/carbon  double  bonds  (1,5, 6, 7,8).  Ager  (1 1) 
discussed  the  mechanical  properties  of  the  G-band  frequency  and  the  tribochemical  wear  rate, 
the  abrasive  wear  rate  and  the  ratio  of  the  area  of  the  D-  and  G-bands.  The  graph.  Fig.  1,  is  a 
reproduction  of  that  discussion  and  the  graph.  Fig.  2,  is  a  typical  finding  from  the  MR  scans 
which  we  obtained  from  the  drill  bits. 


Fig.  1  Raman  spectra  of  an  a-C  film  Fig.  2  Raman  spectra  of  a  drill  bit  after 

(sputtered  carbon)  for  thin  film  drilling  with  cutting  oil  and 

disk  media,  Ager  (11)  1 ,2-Dichlorobenzene 


Nuclear  magnetic  resonance  (NMR)  spectroscopy  (5,6)  has  been  used  to  make  measurements 
on  carbon  films.  Both  carbon  13  magic  angle  spinning  (5)  and  static  single  pulse  excitation  of 
the  carbon  13  nucleus  (6)  have  measured  a  range  of  sp2  versus  sp3  hybridized  carbon  atoms  in 
sputtered  carbon  films,  although  in  one  examination  the  free  radical  content  of  the  films 
interfered  greatly  with  the  measurement  (6). 

A  large  free  radical  content  in  a  carbon  film  indicates  a  large  number  of  so  called  “dangling 
bonds”,  which  are,  per  force,  formed  at  the  surfaces  of  pure  carbon  structures  of  both  sp2  and  sp3 
hybridization.  If  a  specimen  of  carbon  film  possesses  a  large  number  of  such  free  radicals  (along 
with  both  sp2  and  sp3  hybridization),  then  it  is  reasonable  to  assume  that  it  is  composed  of  a  large 
number  of  small  individual  domains  of  those  sp2  and  sp3  carbon  atom  groups.  This  is  especially 
true  if  the  hydrogen  content  of  the  film  is  low  (6).  This  gives  further  credence  to  the  existence  of 
separate  graphitic  and  adamantane  carbon  domains  in  sputtered  carbon  films,  and  due  to  the 
similarity  of  Raman  spectra,  to  the  case  of  carbon  structures  generated  by  the  cutting  event  in 
machining.  Specific  measurements  utilizing  resonance  Raman  spectroscopy  has  determined  the 
presence  of  sp2  (pi  bonded)  carbon  structures  directly  (5). 

Appendix  B  contains  the  relevant  MR  scans  and  can  be  compare  to  Fig.s  1  &  2.  Only  those 
scans  that  showed  desired  results  are  included  due  to  the  large  number  of  plots  taken.  The 
Raman  spectral  scans  taken  from  our  used  drill  bits  were  striking  similar  to  those  taken  from 
sputtered  carbon  films  used  in  research  to  extend  J^e  wear  life  of  various  surfaces  (7,8,9, 11,12). 
Many  but  not  all  of  the  MR  measurements  resulted  in  broad  peak  sets  centered  at  about  1350 
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cm-1  (mixed  carbon)  and  about  1550  to  1600  cm-1  (graphitic  carbon).  In  all  cases  the  graphitic 
band  was  larger  than  the  mixed  band,  although  in  some  cases  they  were  nearly  the  same  size. 
The  mixed  carbon  peaks  were  about  150  cm-1  in  width  and  the  graphitic  peaks  about  120  cm-1 
in  width. 


V.  DISCUSSION 


General 


Small  carbon  particulates,  either  adamantane  or  graphitic  in  type,  should  respond  to  Raman 
measurements  according  to  the  ‘purity’  of  the  structures.  By  this  we  mean  that  if  the  interior  of  a 
carbon  particulate  is  of  a  pure  type,  then  the  Raman  measurement  will  respond  to  that  medium, 
but  with  a  signature  modified  by  the  Raman  active  (impure)  nature  of  the  surface  of  the  particle. 
Logically,  a  range  of  sizes  of  otherwise  identical  particles  might  result  in  a  band,  or  broad  peak 
Raman  signature,  due  to  the  varying  effects  forced  by  surface  to  volume  ratios.  Notably,  a  well 
defined  diamond  structure,  typified  by  a  gemstone  diamond,  results  in  a  sharp  peak, 
located  characteristically  at  1 332  cm- 1 . 

In  association  with  thin  deposited  carbon  films,  the  literature  is  replete  with  Raman  scans 
containing  two  broad  bands  located  at  approximately  1550  and  1350  cm-1,  which  have  been 
interpreted  as  respectively,  a)  evidence  for  graphitic  carbon  (with  a  great  deal  of  sp2  hybridized 
molecular  structures)  and,  b)  evidence  for  a  complicated  mixture  of  sp2  and  sp3  hybridized 
carbon  molecular  structures.  These  interpretations  are  less  than  clear  and  explicit,  especially  in 
the  case  of  the  lower  energy  peak  at  1350  cm-1.  However,  it  seems  evident  that  a  reasonable 
number  of  the  Raman  active  bonds  in  the  1350  cm-1  peaks  will  be  sp3  hybrids,  since  small 
fractions  of  such  bonding  would  simply  widen  the  higher  energy  graphite  peak  toward  lower 
energies. 

If  one  also  assumes  that  the  Raman  sensitivity  for  the  lower  energy  peak  molecule  set  is  roughly 
equivalent  to  that  of  the  higher,  then  one  can  logically  expect  the  peak  areas  to  roughly  reflect 
amounts  of  sp2  versus  sp3  carbon/carbon  bonding.  (Note  that  this  assumption  is  quite  different 
from  assuming  the  Raman  sensitivity  of  diamond  and  graphite  as  being  equal  -  the  literature 
measurements  indicating  that  graphite  is  some  50  times  more  sensitive  than  diamond  to  Raman 
excitation.)  Such  argumentation  immediately  leads  to  the  argument  that  the  1350  cm-1  band  may 
contain  a  reasonable  amount  of  sp2  hybridized  carbon(10),  although  this  conclusion  is  not 
unambiguously  determined  in  the  literature. 

If  it  is  further  assumed  that  these  ‘very  hard’  carbon  structures  do  not  contain  a  great  deal  of 
hydrogen(7)  (on  the  order  of  much  less  than  twice  the  stoichiometry),  then  the  carbon/carbon 
bonding  should  reasonably  contain  considerable  adamantane  or  adamantane  type  carbon 
structures,  and  the  presence  of  a  certain  amount  of  small  particulate  diamond  must  at  least  be 
considered.  This  would  not  necessarily  show  up  at  1332  cm-1  in  Raman  measurements,  due  to 
the  relatively  low  Raman  response  of  the  pure  carbon  adamantane  structure.  Such  a  supposition 
would  argue  that  the  broad  1350  cm-1  peak  was  a  collection  of  small  sp2  and  sp3  hybridized 
carbon  particulates  (either  covalently  connected  oi-0not)  possessing  a  relatively  large  surface  to 


8 


volume  ratio,  due  to  the  very  small  sizes,  at  least  some  of  which  must  have  surface  chemistries 
consisting  of  a  fairly  large  number  of  carbon/hydrogen  bonds. 


Mixed  Carbon  Structures 

The  above  supposition  does  not  exclude  the  possibility  that  there  also  exists,  in  this  complex  of 
carbon  structures,  particulates  consisting  predominately  of  graphitic  carbon,  or  at  least  carbon 
structures  which  contain  large  numbers  of  carbon/carbon  double  bonds.  Such  a  complex  would 
be  almost  completely  carbon  by  elemental  analysis,  but  might  contain  small  domains  of  graphitic 
carbon  interconnected  covalently  by  small  domains  of  adamantane  carbon,  forming  (at  last)  a 
greatly  extended,  mostly  carbon  three  dimensional  complex,  with  very  little  ability  for  mutual 
group  translational  motion.  Such  a  structure  might  well  test  as  a  very  hard  material,  since 
molecular  group  motion  to  any  extent  would  require  the  breaking  of  a  number  of  strong  covalent 
carbon/carbon  bonds  or  extensive  bending  of  molecular  structures.  These  structures  might  look 
very  different,  spectrometrically,  from  gemstone  diamond,  but  could  nevertheless  approach 
diamond  in  structural  hardness. 

It  should  be  observed  that  pure  graphite  (at  atmospheric  pressure)  is  a  soft  material,  and  is  used 
as  a  solid  lubricant.  But  under  vacuum  conditions  it  has  been  found  to  be  an  abrasive.  The 
anecdotal  reason  is  that  atmospheric  gas  molecules  act  as  "roller  bearings"  between  the  large, 
unconnected  graphite  sheets,  which  effectively  disappears  in  a  good  vacuum.  This  experimental 
observation  re-inforces  the  theoretical  argument  that  the  graphite  planes  are  extremely  rigid  in 
two  dimensions  and  still  quite  rigid  in  the  third  dimension,  due  to  plane-on-plane  re-inforcement. 
If  such  structural  planes  were  immobilized  by  extensive  carbon  cross-linking,  even  at 
atmospheric  pressure,  the  result  arguably  could  be  a  very  hard  and  abrasive  carbon  material, 
possessing  only  a  modicum  of  sp3  carbon  crosslinking  between  graphitic  sheets. 


VI.  CONCLUSION 

This  result,  and  the  fact  that  the  literature  accepts  the  1500  to  1600  cm-1  band  as  graphitic  in 
origin,  causes  us  to  propose  that  the  lower  energy  broad  peak  located  about  1350  cm-1  consists 
of ‘protographite’  and  ‘protodiamonds’  -  or  basically  adamantane  carbon  structures  of  relatively 
small  molecular  weight  compounded  with  similar  sized  graphitic  structures(lO),  each  type 
possessing  corresponding  large  (perhaps  C-H  containing)  surfaces  contributing  to  the  varying 
Raman  wavelengths  defining  the  band.  It  is  also  postulated  that  these  carbon/carbon  double 
bonded  structures  (graphitic)  and  the  carbon/carbon  single  bonded  structures  (adamantane)  may 
be  connected  by  the  following  possible  covalent  linkages:  a)  one  or  more  methylene  groups  in 
series,  b)  direct  bonding  between  an  apical  or  equitorial  adamantane  carbon  and  a  graphitic 
double  bonded  carbon,  or  c)  any  of  a  number  of  possible  partially  saturated  carbocycles  or  linear 
arrangements  bonded  covalently  to  individual  graphitic  or  adamantane  domains,  some  of  which 
may  originate  containing  ‘dangling  bonds’. 

We  believe  that  the  absence  of  a  strong  1332  cm-1  Raman  peak  eliminates  the  possibility  of 
adamantane  structures  of  large  extent,  relative4  to1  ktomic  or  small  molecule  sizes.  Due  to  the 
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large  size  of  some  of  the  1350  cm-1  peaks,  we  believe  that  there  is  at  least  the  possibility  that  a 
very  hard  form  of  relatively  pure  carbon  is  being  formed  by  the  heat  and  pressure  of  the  cutting 
operation,  and  that  this  relatively  pure  carbon  may  consist  of  very  small  graphitic  and 
adamantane  domains,  connected  covalently  by  sp3  or  mixed  sp3/sp2  hybrid  carbon  structures.  The 
available  literature  dealing  with  sputtered  carbon  films  states  that  such  carbon  structures  have 
been  measured  to  be  very  hard  -  in  the  range  of  20%  of  that  of  diamond  (9) 

Follow-on  research  should  focus  on  a  method  to  generate  a  substantial  amount  of  the 
carbonaceous  material  identified  by  spectroscopy  as  being  possibly  ‘hard  carbon’.  This  material 
can  then  be  subjected  to  hardness  tests  to  finally  determine  whether  damond-like,  hard  carbon  is 
generated  at  the  cutting  edge  of  a  tool  steel  drilling  event. 
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APPENDIX  A. 


The  force  plots  for  the  drilling  operation  corresponding  to  the  MR  scans  found  in  Appendix  B. 
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APPENDIX  B. 


Contents: 

Listing  of  all  doping  agents  tested. 

Listing  of  doping  agents  yielding  positive  results. 

The  Raman  spectra  scans  from  1 100  -  1700  cm'1  for  lubricated  drill  bits  that  showed  “hard 
carbon”  patterns. 

The  Raman  spectra  scans  from  840  -  2000  cm'1  for  lubricated  drill  bits  that  showed  “hard 
carbon”  patterns. 
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APPENDIX  B 

All  materials  tested 
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HYDROCARBON  FRACTION 
HYDROCARBON  FRACTION 


HYDROCARBON  FRACTION 
HYDROCARBON  FRACTION 
HYDROCARBON  FRACTION 


HYDROCARBON  FRACTION 
HYDROCARBON  FRACTION 


HYDROCARBON  FRACTION 
HYDROCARBON  FRACTION 
HYDROCARBON  FRACTION 


ANMS  AA01  HYDROCARBON  FRACTION 
ANMS  AA02  HYDROCARBON  FRACTION 


HYDROCARBON  FRACTION 
HYDROCARBON  FRACTION 


ANMS  AD01  HYDROCARBON  FRACTION 


ANMS  AD02  HYDROCARBON  FRACTION 
ANMS  AD03  HYDROCARBON  FRACTION 


ANMS  AE01  HYDROCARBON  FRACTION 

ANMS  B101  LIGHT  MACHINE  OIL 


ANMS 

A001 

ANMS 

A002 

ANMS 

A101 

ANMS 

A201 

ANMS 

A301 

ANMS 

A401 

ANMS 

A402 

ANMS 

A501 

ANMS 

A601 

ANMS 

A701 

ANMS 

A801 

ANMS 

A901 

ANMS 

AA01 

ANMS 

AA02 

ANMS 

AB01 

ANMS 

AC01 

DOPING  AGENT 


1 ,4-DIOXANE _ 

1 ,4-DIOXANE 


STYRENE _ _ 

IODOMETHANE 


2,4-PENTANEDIONE _ 

PROPARGYL  CHLORIDE _ 

PROPARGYL  CHLORIDE 


3-METHYL-2-BUTANONE _ 

1,2-DICHLOROBENZENE 


SULFOLANE _ _ 

ACETONITRITE _ 

TETRAM  ETH  YLETH  YLENED I  AM  I N  E 


DIMETHYL  ACETAMIDE _ 

DIMETHYL  ACETAMIDE 


1,1,1-TRICHLOROETHANE _ 

1 ,2-DIBROMOETHANE 


CHLOROFORM 


CHLOROFORM _ 

CHLOROFORM  ’  ^ 


PHENYL  ACETYLENE _ 

STYRENE 


SHOWED 


NOTHING 


ANMS 

B201 

LIGHT  MACHINE  OIL 

IODOMETHANE 

ANMS 

ii301 

LIGHT  MACHINE  OIL 

2,4-PENTANEDIONE 

ANMS 

B601 

LIGHT  MACHINE  OIL 

1 ,2-DICHLOROBENZENE 

LIGHT  MACHINE  OIL 

SULFOLANE 

★ 

LIGHT  MACHINE  OIL 

ACETONITRITE 

* 

ANMS 

BA01 

LIGHT  MACHINE  OIL 

DIMETHYL  ACETAMIDE 

* 

ANMS 

BD01 

LIGHT  MACHINE  OIL 

CHLOROFORM 

* 

ANMS 

C101 

30  W  MOTOR  OIL 

STYRENE 

* 

ANMS 

C401 

30  W  MOTOR  OIL 

PROPARGYL  CHLORIDE 

★ 

ANMS 

C601 

30  W  MOTOR  OIL 

1 ,2-DICHLOROBENZENE 

* 

ANMS  D003 


ANMS  D101 
ANMS  D201 


S 

D2 

S 

D3 

30  W  MOTOR  OIL 

DIMETHYL  ACETAMIDE 

* 

30  W  MOTOR  OIL 

CHLOROFORM 

* 

CUTTING  OIL  W/SULFUR 

1 ,4-DIOXANE 

* 

CUTTING  OIL  W/SULFUR 

1,4-DIOXANE 

* 

CUTTING  OIL  W/SULFUR 

1 ,4-DIOXANE 

* 

CUTTING  OIL  W/SULFUR 

STYRENE 

* 

CUTTING  OIL  W/SULFUR 

IODOMETHANE 

* 

CUTTING  OIL  W/SULFUR 

IODOMETHANE 

* 

CUTTING  OIL  W/SULFUR 

2,4-PENTANEDIONE 

* 
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ANMS 

D302 

CUTTING  OIL  W/SULFUR 

2,4-PENTANEDIONE 

* 

ANMS 

D401 

CUTTING  OIL  W/SULFUR 

PROPARGYL  CHLORIDE 

* 

ANMS 

D501 

CUTTING  OIL  W/SULFUR 

3-METHYL-2-BUTANONE 

* 

ANMS 

D601 

CUTTING  OIL  W/SULFUR 

1 ,2-DICHLOROBENZENE 

ANMS 

D603 

CUTTING  OIL  W/SULFUR 

1,2-DICHLOROBENZENE 

ANMS 

D701 

CUTTING  OIL  W/SULFUR 

SULFOLANE 

* 

ANMS 

D703 

CUTTING  OIL  W/SULFUR 

SULFOLANE 

* 

ANMS 

D801 

CUTTING  OIL  W/SULFUR 

ACETONITRITE 

* 

ANMS 

D901 

CUTTING  OIL  W/SULFUR 

TETRAMETHYLETHYLENEDIAMINE 

* 

ANMS 

DA01 

CUTTING  OIL  W/SULFUR 

DIMETHYL  ACETAMIDE 

* 

ANMS 

DAT2 

CUTTING  OIL  W/SULFUR 

DIMETHYL  ACETAMIDE 

* 

ANMS 

DAT3 

CUTTING  OIL  W/SULFUR 

DIMETHYL  ACETAMIDE 

* 

ANMS 

DATU 

CUTTING  OIL  W/SULFUR 

DIMETHYL  ACETAMIDE 

* 

ANMS 

DBOI 

CUTTING  OIL  W/SULFUR 

1 ,1 ,1-TRICHLOROETHANE 

* 

ANMS 

DBPU 

CUTTING  OIL  W/SULFUR 

1,1,1-TRICHLOROETHANE 

* 

ANMS 

DC01 

CUTTING  OIL  W/SULFUR 

1 ,2-DIBROMOETHANE 

* 

ANMS 

DC02 

CUTTING  OIL  W/SULFUR 

1 ,2-DIBROMOETHANE 

* 

ANMS 

DD01 

CUTTING  OIL  W/SULFUR 

CHLOROFORM 

ANMS 

DD02 

CUTTING  OIL  W/SULFUR 

CHLOROFORM 

* 

ANMS 

DD03 

CUTTING  OIL  W/SULFUR 

CHLOROFORM 

★ 

ANMS 

DE01 

CUTTING  OIL  W/SULFUR 

PHENYL  ACETYLENE 

- * - 

XNMS 

AR01 

CALIBRATION 
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SAMPLE 

BASE 

DOPING  AGENT 

ANMS 

A402 

HYDROCARBON  FRACTION 

PROPARGYL  CHLORIDE 

ANMS 

A801 

HYDROCARBON  FRACTION 

ACETONITRITE 

ANMS 

AA02 

HYDROCARBON  FRACTION 

DIMETHYL  ACETAMIDE 

HYDROCARBON  FRACTION 

CHLOROFORM 

Ql 

HYDROCARBON  FRACTION 

PHENYL  ACETYLENE 

ANMS 

B201 

LIGHT  MACHINE  OIL 

IODOMETHANE 

ANMS 

B301 

LIGHT  MACHINE  OIL 

2,4-PENTANEDIONE 

ANMS 

D601 

CUTTING  OIL  W/SULFUR 

1 ,2-DICHLOROBENZENE 

ANMS 

D603 

CUTTING  OIL  W/SULFUR 

1 ,2-DICHLOROBENZENE 

ANMS 

DD01 

CUTTING  OIL  W/SULFUR 

CHLOROFORM  ; 
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A  Performance  Study  of  Plasma  Source  Ion-Implanted  Tools 
Versus  High-Speed  Steel  Tools 

Brian  K.  Lambert  and  Roger  H.  Woods 
New  Mexico  State  University 
Las  Cruces,  NM  88003 

Abstract 


The  performance  of  plasma  source  ion-implanted  high-speed  steel  tools  were  compared 
with  untreated  high-speed  steel  tools  in  a  test  modeled  after  actual  manufacturing  usage. 
The  variables  of  interest  were  tool  wear  and  the  resulting  surface  finish  of  the  work  piece. 
A  set  of  12  M2  high-speed  steel  turning  tools  were  implanted  with  N+,  resulting  in  a 
retained  dosage  of  2.5  x  1017  N-at/cm2  using  plasma  source  ion  implantation.  The  12  tools 
where  compared  to  12  unimplanted  tools  using  a  simple  turning  operation  on  4140  steel. 
The  implanted  tools  exhibited  an  increase  in  wear  resistance  by  a  factor  of  1.5  over  the 
unimplanted  tools,  and  implantation  exhibited  no  effect  on  the  resulting  surface  finish. 

I.  Introduction 

Tool  wear  and  tool  failure  are  areas  researched  in  great  detail  to  improve  the  manufacturing 
process.  New  coatings  and  materials  have  been  developed  to  lengthen  the  life  of  metal 
cutting  tools.  Unfortunately,  these  processes  can  produce  harmful  by-products  and 
adversely  affect  the  environment.  Ion  implantation  has  been  shown  to  improve  the  wear 
characteristics  of  metals  without  producing  environmentally  harmful  by-products. 

Throughout  the  machining  process,  the  dimensions  of  a  tool  change  because  of  wear  to  its 
contact  surfaces.  Manufacturing  engineers  want  to  predict  that  wear  to  hold  tolerances  and 
surface  finishes  throughout  the  process.  Being  able  to  predict  the  wear  of  the  tool  also 
allows  the  engineer  to  determine  the  life  of  the  tool,  thereby  avoiding  a  catastrophic  tool 
failure  and  possible  damage  to  the  part,  the  operator,  and  machinery.  The  engineer  can 
effectively  model  this  wear  by  understanding  the  variables  that  determine  tool  life  and  wear 
rate:  tool  material,  tool  geometry,  machining  variables,  and  work  piece  variables. 

The  tool  material  used  is  determined  by  the  material  to  be  machined  and  the  process  under 
which  the  material  goes.  The  development  of  new,  stronger,  harder,  tougher  tool  materials 
has  been  an  area  of  great  importance  in  manufacturing.  The  tool  material  must  have  certain 
traits  and  characteristics  to  resist  wear  and  tool  failure. 
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Typically,  an  increase  in  the  hardness  of  the  tool  material  decreases  its  fracture  toughness. 
However,  the  hardness  increases,  the  material  may  be  too  hard  to  properly  form  into  the 
necessary  tool  shape.  Coatings  have  been  developed  to  increase  wear  resistance  but  may 
not  remain  adhered  to  the  surface  of  the  tool  and  can  produce  harmful  by-products.  The 
engineer  continuously  balances  the  speeds,  feeds,  and  tool  material  to  achieve  the  lowest 
cost  possible.  Speeds  and  feeds  must  be  kept  low  enough  to  provide  for  a  minimum 
acceptable  tool  life;  otherwise,  the  cost  of  changing  the  tool  may  outweigh  the  productivity 
gains  in  increased  cutting  speed.  The  advent  of  ion  implantation  technology  may  have 
solved  some  of  these  materials  problems. 

A.  Tool  Wear 

Abrasive  wear  is  the  primary  wear  mechanism  during  steady  state  wear  at  low  temperatures 
and  is  measured  by  the  flank  wear  of  the  tool.  Plastic  deformation  occurs  at  high 
temperature  and  high  cutting  pressures.  The  tool  deforms  under  the  pressure  and 
temperature,  rounding  the  cutting  edge  of  the  tool.  Chemical  decomposition  weakens  the 
bonds  between  minute  tool  segments,  resulting  in  the  particles  being  pulled  off  the  tool.  As 
the  work-hardened  chip  passes  by  the  tool,  the  asperites  in  the  tool  surface  may  be  welded 
to  the  chip  and  effectively  pulled  off.  This  wear  occurs  on  the  top  of  the  tool,  resulting  in 
crater  wear. 

Tool  wear  is  measured  by  the  area  of  flank  wear  or  the  depth  of  crater  wear  on  a  tool.  Flank 
wear  occurs  on  the  front  surface  or  flank  where  the  tool  contacts  the  work  piece  material. 
Crater  wear  occurs  on  the  top  of  the  tool  where  the  work  hardened  chip  contacts  the  tool 
surface.  Tool  failure  occurs  when  the  tool  can  no  longer  produce  parts  to  specifications 
because  of  the  wear  to  the  tool.1 

Tool  life  is  determined  by  a  set  amount  of  flank  wear  that  occurs  before  the  tool  is  unable  to 
hold  tolerance  or  could  experience  catastrophic  tool  failure.  The  generally  accepted  value 
for  maximum  wear  land  is  0.025  to  0.030  inch.1>2  Wear  land  is  determined  by  measuring 
the  height  of  tool  wear  (Wf  or  Fw)  along  the  contact  length  (1  or  x)  as  illustrated  in  Fig.  1 
from  DeGarmo2. 


Chtpc  in 


Figure  1.  Tool  wear  (DeGarmo2) 
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B.  Surface  Finish 

Attaining  the  proper  surface  quality  is  an  important  part  of  producing  quality  parts.  The 
close  tolerances  required  in  today’s  complex  systems  require  that  the  finishes  be  achieved 
according  to  the  specifications  each  and  every  time.  Surface  roughness  changes  as  the 
machining  process  progresses  because  of  the  wear  that  occurs  on  the  flank  of  the  tool, 
changing  its  dimensions.  This  tool  wear  can  progress  to  the  point  where  the  tool  is  not 
producing  parts  with  the  necessary  finish  or  to  the  necessary  tolerances. 

C.  Ion  Implantation 

Ion  implantation  modifies  surface  sensitive  properties  of  different  materials.  Ion 
implantation  increases  the  wear  resistance,  corrosion  resistance,  fatigue  resistance  and 
hardness  of  many  materials.  Implantation  is  used  both  to  harden  cutting  tools  and  as  a 
surface  finish  on  parts  that  are  subjected  to  heavy  wear  or  extreme  conditions. 

Scientists  discovered  that  the  ability  to  introduce  any  ion  into  the  surface  region  of  a 
substrate  without  the  constraints  of  thermodynamic  equilibrium  could  have  distinct 
advantages  in  metal  processing.  Using  ion  implantation,  a  certain  impurity  can  be 
introduced  into  the  surface  layer  in  a  very  pure,  clean,  controllable,  and  reproducible 
way.3-4  The  advent  of  ion  implantation  as  a  surface  treatment  has  now  made  possible  the 
formation  of  compounds  outside  the  usual  restraints  of  a  metal’s  thermodynamic 
equilibrium. 

Unlike  coating  techniques,  ion  implantation  changes  the  molecular  and  chemical  structure 
of  the  surface  without  adding  any  volume;  therefore,  dimensional  tolerances  are 
maintained.5’6  Normally,  the  real  weakness  in  deposition  or  coating  techniques  lies  in  the 
adhesion  of  a  binder  or  glue.  The  surface  layer  in  ion  implantation  becomes  an  integral  part 
of  the  material  without  the  discontinuities  or  adhesion  problems  found  in  other 
processes.4*7 

The  temperature  of  the  target  can  be  controlled  during  implantation  and,  therefore, 
introduce  impurities  during  an  earlier  metal  stage  in  crystal  growth.  The  ability  to  control 
the  temperature  allows  the  treatment  to  be  done  at  room  temperatures,  and  the  bulk 
properties  of  the  metal  remain  unchanged.8  Parts  and  tools  can  be  produced  using  cheaper 
base  metals  that  are  easily  formed  and  then  implanted  to  produce  the  desired  increases  in 
hardness,  corrosion  resistance,  etc. 

Until  recently,  the  majority  of  the  research  and  development  outside  the  semi-conductor 
industry  had  been  in  academia  rather  than  in  industry .*  The  emphasis  in  implantation 
research  has  been  on  optimizing  the  process  and  determining  the  proper  implantation 
voltages,  doses,  and  dopants.  The  results  of  these  tests  have  been  investigated  using  pin- 
on-disk  tests  and  other  laboratory-based  experiments.  The  maturation  of  this  technology 
has  now  allowed  scientists  to  investigate  the  practical  applications  of  ion  implantation. 

Ion  implantation  is  energy  deposition  in  a  target  material.  Ions  are  produced  and  then 
accelerated  towards  a  target  material  where  their  interactions  produce  the  resulting  chemical 
and  physical  property  changes.  Ions  penetrate  the  work  piece  because  of  their  high  kinetic 
energy  that  is  gained  by  accelerating  them  in  an  magnetic  field.10  Through  elastic  collision, 
the  ions  displace  the  target  atoms  from  their  lattice  positions,  creating  a  distorted  lattice 
structure.11  The  ion’s  energy  carries  through  the  voids  in  the  lattice,  creating  distortions  as 
it  collides  with  atoms  until  it  comes  to  rest  in  a  vacancy  in  the  lattice  structure. 
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Plasma  source  ion  implantation  was  developed  to  implant  irregularly  shaped  surfaces  with 
an  equal  dose  of  ions.  The  target  is  placed  direcdy  in  the  plasma  source  (usually  nitrogen) 
and  pulsed  to  high  negative  potential  relative  to  the  chamber  walls.  The  ions  are  accelerated 
normal  to  the  target  surface,  across  the  plasma  sheath,  impacting  the  target  normal  to  the 
surface.7-8’10-12 


Plasma  Source  Ion  Implantation 

•  Plasma  Sheath  Surrounds  target 

•  Jons  bambard  an  surfaces  of  target  without 
beam  rastering  or  target  manipulation 


In  PSII,  the  ion  "beamlets"  bombard 
the  target  at  nearly  normal  incidence, 
thereby  eliminating  the  need  for 
target  masking. 


Figure  2.  Plasma  Source  Ion  Implantation  (Conrad12) 

As  Fig.  2  illustrates,  no  masking  or  beam  manipulation  is  required  with  this  method. 
Plasma  source  ion  implantation  provides  a  uniform  dose  to  all  of  the  target  regardless  of  its 
shape.12 

Ion  implantation  has  increased  the  hardness,  corrosion  resistance,  fatigue  resistance,  and 
wear  resistance.  These  four  areas  are  of  prime  concern  to  any  designer,  mechanical 
engineer,  or  metallurgist  in  the  design  of  a  tool  or  component.  Certain  ions  implanted  in  the 
heavily  used  metals  such  as  steel,  aluminum,  and  iron  can  modify  the  desired  characteristic 
for  that  specific  usage.  Ion  implantation  increased  some  of  these  characteristics  from  a 
100%  increase  to  a  10-fold  increase  in  some  cases.7’10’11 

Ion  implantation  increases  the  surface  hardness  of  the  target  by  combining  three  different 
reactions  between  the  ions  and  the  target’s  microstructure.  First,  the  action  of  the  ion 
slamming  into  the  lattice  structure  acts  as  a  microscopic  peening  operation.  The  ions  distort 
the  lattice  structure  and  cause  strain  hardening  to  occur  in  the  case  of  the  target.  Secondly, 
the  ions  travel  down  the  various  voids  and  inclusions  to  finally  wedge  in  the  voids.  This 
action  fills  the  microcracks  and  boundaries  between  the  grains,  increasing  the  strength  of 
the  inter-granular  bonds.  The  third  reaction  is  a  chemical  reaction  that  can  occur  between 
the  target's  base  metal  and  the  implanted  ions.  These  reactions  produce  intermetallic 
compounds  such  as  the  formation  of  iron  nitrides  in  steel  and  iron,  producing  a  harder  alloy 
on  the  case  of  the  target.5*8’13’14 


These  increases  in  hardening  are  usually  done  by  implanting  titanium,  nitrogen,  boron, 
argon,  and  molybdenum.  Studies  have  shown  varying  success  with  almost  ail  elements  in 
the  periodic  table.15  Nitrogen  is  used  with  almost  all  metals  and  is  used  the  most  often  in 
ion  implantation  metallurgy.  Studies  have  shown  hardness  increases  of  100%  to  200%  in 
the  cases  of  nitride-implanted  steels.7*10’13 

The  increased  hardness  corresponds  to  a  direct  increase  in  the  wear  resistance  of  implanted 
samples.  In  one  case,  the  wear  rates  were  reduced  by  up  to  a  factor  of  30,  others  decreased 
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by  1,4,  and  10  orders  of  magnitude.14’16  The  increased  hardness  is  not  thought  to  be  the 
only  reason  that  the  wear  resistance  can  be  improved  dramatically.  In  some  cases,  ion 
implantation  lowers  the  coefficient  of  friction  by  a  factor  of  2,  which  has  a  direct  influence 
on  the  wear  rate.4’7’13 

Ion  implantation  also  decreases  wear  by  encouraging  the  growth  of  oxide  films  that  occur 
between  the  two  sliding  surfaces.  These  films  naturally  occur  on  some  steels,  but  the 
implantation  process  develops  thicker  films.  This  film  formation  is  attributed  to  the 
chemical  changes  brought  on  by  the  implantation  and  the  formation  of  nitrides  in  the 
surface  layer  when  implanting  with  nitrogen.8-14  Oxide  films  act  as  a  solid  lubricant  in  the 
wear  mechanism  and  drastically  slow  down  the  wear  process. 


II.  Recent  Research 

As  the  amount  of  research  into  ion  implantation  metallurgy  increased,  the  need  to  start 
applying  this  basic  research  to  practical  applications  became  apparent.  The  vast  amount  of 
basic  research  was  starting  to  be  applied  with  the  help  of  the  U.S.  Government  and  the 
Army  Research  Office.  With  the  cooperation  of  the  Corpus  Christi  Army  Depot  (CCAD) 
and  the  Rock  Island  Arsenal,  implanted  tooling  began  to  be  researched  in  the  metal 
manufacturing  environment.  Hughes  Research  Laboratory  expanded  their  plasma  research 
into  the  implantation  of  production  tooling  used  on  the  shop  floor.  All  three  entities  have 
found  markedly  improved  increase  in  tool  life  with  ion  implantation. 

The  Rock  Island  Arsenal  worked  with  Dr.  John  Conrad  at  the  University  of  Wisconsin- 
Madison  to  enhance  the  wear  lifetimes  of  uncoated  and  TiN-coated  HSS  drills.  The  coated 
drills  were  implanted  with  carbon;  uncoated  drills  were  implanted  using  an  acetylene 
plasma  that  produced  a  high  hardness-low  friction  diamond-like  carbon  (DLC)  coating.  The 
TiN-coated  and  implanted  tools  showed  an  increase  in  tool  life  and  wear  resistance,  but  not 
as  large  an  improvement  as  CCAD’s  or  Hughes’s  tooling.  The  uncoated  tools  did  not  show 
a  statistically  significant  improvement.  Since  then,  the  application  of  DLC  coating  has  been 
improved  upon,  and  further  research  is  under  way  to  understand  its  effect  on  wear 
resistance6 

CCAD  compared  the  performance  of  nitrogen-implanted  tools  in  a  controlled  but  realistic 
production  environment.  They  implanted  uncoated  M4  HSS  taps,  a  TiN-coated  tungsten 
carbide  cutting  insert,  and  two  uncoated  D2  steel  alloy  punch  die  sets.  After  implantation 
the  tools  were  marked  and  mixed  in  with  unimplanted  tooling,  the  operators  did  not  know 
of  the  difference.  The  performance  criterion  was  the  tool’s  ability  to  make  parts  within 
specs,  the  parts  made  until  failure.  The  taps  exhibited  a  relative  performance  increase  of  1.8 
to  4.4;  the  inserts  a  3.8  increase;  and  the  dies,  a  1.9  increase.17 

Hughes  Research  Laboratories  has  used  plasma  ion  implantation  to  achieve  a  2  to  3X 
improvement  in  wear  life  of  cobalt-cemented,  tungsten  carbide  drill  bits,  and  an  8X  reduced 
wear  rate  for  TiN-coated  coated  cutting  tools.  Hughes  wanted  to  validate  plasma  ion 
implantation  technology  in  actual  field/factory  evaluations.  Hughes  used  die  same  blind  test 
that  both  Rock  Island  and  CCAD  used  to  test  unimplanted  tools  versus  implanted  tooling. 
The  experiment  was  identical  to  CCAD’s  to  provide  a  comparison  of  ion-beam  implantation 
used  by  CCAD  with  plasma  implantation  used  at  Hughes. 30 

These  tests  were  the  first  to  be  done  with  implanted  tooling  in  a  realistic  manufacturing 
environment.  They  have  shown  increases  in  tool  life  and  wear  resistance.  The  small  sample 
size  (three  tools  in  Rock  Island’s  study)  |n<|jincontrolled  shop  floor  environments  show  a 
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need  for  further  scientific  study  into  implantation  applications.  Each  study  stated  a  need  for 
further  investigation  into  various  tool  types,  operations,  and  implantation  variables.18 


III.  The  Experiment 
A.  Experimental  Design 

The  experiment  had  three  independent  variables  of  interest:  tool  type,  speed,  and  feed.  The 
experiment  used  two  tool  types,  three  speeds,  and  three  feeds.  A  2  x  3  x  3  factorial  design 
was  used  to  determine  the  interactions  and  perform  accurate  statistical  analysis.  The  2  x  3  x 
factorial  design  does  not  allow  the  use  of  a  center  point  to  measure  the  accuracy  of  the 
experiment,  so  three  additional  trails  at  the  middle  speeds  and  feeds  with  each  tool  type 
were  used.  This  procedure  provided  an  estimation  of  the  error  in  the  experiment  without 
having  to  run  two  or  more  tests  for  every  level.19  The  24  trials  were  randomized  using 
JMP  statistical  software  by  SAS. 

The  experiment  involved  identical  machining  parameters  for  the  two  types  of  tooling.  The 
purpose  is  to  machine  the  work  piece  long  enough  to  obtain  significant  wear  on  the  tool  to 
measure  the  possible  decrease  in  performance  associated  with  the  tool  wear.  The 
experiment  used  three  different  cutting  speeds  to  allow  the  analysis  of  different  wear  rates 
versus  time.  The  recommended  machining  parameters  for  HSS  M2  grade  tools  on  4140 
steel  are  DOC  =  .04  in..  Speed  =  135  fpm,  Feed  =  .0070  ipr.20 

Using  this  information  as  a  baseline,  the  speeds  and  feeds  were  varied  while  keeping  the 
depth  of  cut  constant  at  .05  inches.  Three  speeds  and  three  feeds  were  used  resulting  in 
nine  combinations  for  each  type  of  tool,  plus  three  extra  trials  for  each  tool  at  the  middle 
speed  and  feed.  The  RPM  settings  on  the  lathe  were  the  determining  factor  in  the  choice  of 
speeds.  The  feeds  were  selected  by  finding  the  closest  feed  possible  in  the  lathe  to  the  target 
feeds  of  0.0070, 0.0085,  and  0.010.  The  gearing  mechanisms  that  determine  the  feed  rates 
allow  0.0069,  0.0084,  and  0.0099  respectively. 

Speeds  Level  Feeds 

1 25  fpm  ( 1 60  RPM)  - 1  0.0069  ipr 

157  fpm  (200  RPM)  0  0.0084  ipr 

196  fpm  (250  RPM)  1  0.0099  ipr 

The  speeds  used  are  higher  than  those  that  are  recommended  so  that  the  tools  would 
experience  accelerated  tool  wear.  At  lower  speeds,  the  amount  of  work  piece  material  and 
time  required  would  be  unreasonable. 

Each  tool  was  used  for  20  minutes  of  machining,  which  allowed  for  substantial  tool  wear 
but  was  not  long  enough  to  induce  tool  failure.  The  wear  was  checked  every  5  minutes  to 
predict  a  wear  rate  of  each  tool  at  the  various  levels. 

The  experiment  was  performed  at  the  New  Mexico  State  University’s  Advanced 
Manufacturing  Laboratory.  A  Nardini  engine  lathe  was  used  to  turn  the  work  pieces.  The 
lathe  is  equipped  with  a  four-jaw  chuck  and  a  live  center  mounted  on  the  tailstock.  The 
work  piece  was  stabilized  with  a  traveling  rest  and  a  steady  rest. 
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B.  Tools 


Twenty  four  heat  treated,  M2,  right-hand  turning,  high-speed  steel  tools  with  a  .075  in. 
nose  radius,  15°  end-relief  angle,  and  44°  side-relief  angle  were  used.  The  composition  of 
M2  tool  steel  is  as  follows: 


Carbon  1% 

Tungsten  6% 

Molybdenum  5% 

Chromium  4% 

Vanadium  2% 21 


High-speed  steel  tools  were  chosen  because  research  has  shown  significant  increases  in 
tool  life  in  ion-implanted  high-speed  steel  tools.  High-speed  steel  tools  also  provide  a  fast 
enough  wear  rate  to  be  measurable  within  a  reasonable  amount  of  time  and  using  a 
reasonable  amount  of  work  piece  material.  Twelve  of  the  tool  bits  were  sent  to  Los  Alamos 
National  Laboratory  to  be  implanted  with  nitrogen  at  the  laboratory’s  plasma  source  ion 
implantation  facility.  They  were  implanted  with  nitrogen  with  retained  a  dose  of  2.5xl017 
N+  at/cm2- 


C.  Material 

Three  20  ft.  lengths  of  3-in.  diameter,  4140  steel,  hot  rolled,  bar  stock,  Brinell  Hardness  - 
18721,  was  be  cut  into  work  pieces  36  in.  long.  A  steady  rest  and  traveling  rest  was  used  to 
eliminate  chatter  and  improve  dimensional  stability.  The  length  of  36  inches  was  chosen 
because  that  is  the  maximum  length  that  the  lathe  can  fit  when  using  a  tailstock  and  with  the 
use  of  the  steady  rest  and  traveling  rest  provides  24  inches  of  machinable  material.  4140 
steel  is  a  common  alloy  that  is  moderately  difficult  to  machine  and  can  provide  significant 
tool  wear. 


The  composition  of  4140  steel  is  as  follows: 

Carbon 

Mn 

P  max 

S  max 

Silicon 

Chromium 

Molybdenum 


.38  to  .43% 
.75  to  1.00% 
.035% 

.040% 

.2  to  .35% 

.8  to  1.1% 

.15  to  .25%  21 


D.  Data  Collection 

Data  were  collected  to  determine  wear  rates  and  surface  finish.  After  each  5-min.  machining 
run,  the  tool  was  measured  for  the  resultant  wear  land.  The  measurements  were  done  on  an 
Opticom  optical  magnifier  with  a  100X  magnification  power  and  a  measurement  accuracy 
of  0.0005  inches. 

Each  5-min.  tool  run  was  marked  on  the  bar.  When  the  machinable  material  on  the  bar 
became  less  than  that  requited  for  the  next  trial,  the  bar  was  taken  off  the  lathe  and  five 
surface  roughness  measurements  were  taken  on  each  5-min.  run.  The  bar  was  partially 
rotated  after  each  measurement  to  preclude  any  skew  in  the  data.  The  surface  roughness  of 
the  resulting  work  pieces  was  measured  by  a  SURFCOM  120A  surface  texture  measuring 
instrument  manufactured  by  Tokyo  Seimitsu  Co.  The  Ra  (Arithmetic  Average),  the  mean 
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surface  roughness,  was  the  variable  of  interest.  The  profilometer  measured  a  length  of 
.2953  inches  each,  with  a  magnification  of  5000X. 

IV.  Results 

The  experiment  produced  four  data  points  per  trial  for  surface  finish  and  tool  wear  at  5- 
min.  intervals  in  each  test.  The  data  points  were  averaged  to  produce  a  mean  wear  for  each 
test  and  divided  by  the  time  to  produce  a  wear  rate.  The  results  are  given  in  Table  I. 

Table  I:  Data  for  Surface  Finish  and  Tool  Wear 


Un-Implanted  Implanted 


Speed  Feed 

Mean 

Wear 

xlO*2 

Wear 

Rate 

xlO-5 

in/min 

Surface 

Finish 

Mean 

Wear 

xlO-2 

Wear 

Rate 

xlO-5 

in/min 

Surface 

Finish 

-1 

-1 

4.56250 

9.25 

102.05 

0.38000 

1.25 

140.59 

-1 

0 

1.08500 

7.00 

171.85 

0.91625 

11.50 

184.08 

0 

-1 

1.00875 

32.00 

173.54 

0.94500 

11.25 

143.83 

1 

1 

1.47250 

25.50 

237.75 

2.01125 

5.75 

319.57 

0 

1 

5.20625 

108.50 

253.74 

0.73875 

9.25 

142.72 

1 

-1 

0.76125 

9.50 

100.86 

0.71750 

.75 

117.88 

1 

0 

1.76875 

3.25 

197.63 

1.49625 

4.00 

150.43 

-1 

1 

1.28750 

6.50 

174.92 

1.68625 

28.25 

233.58 

0 

0 

0.95125 

12.75 

164.18 

1.37875 

6.50 

170.31 

0 

0 

1.27375 

10.25 

235.43 

1.68750 

6.25 

196.50 

0 

0 

0.88750 

14.00 

155.15 

1.29750 

13.75 

172.76 

0 

0 

1.19625 

14.75 

155.08 

0.68875 

9.75 

172.25 

The  overall  relative  improvement  in  tool  wear  was  calculated  to  be  1.5  times  for  the 
implanted  tools  versus  the  unimplanted  tools.  The  mean  wear  for  the  two  tool  types  at  the 
three  speeds  is  illustrated  below  in  Fig.  3. 


Mean  Wear 
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Figure  3.  Mean  Wear  at  the  3  Speed  Levels 
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The  experiment  and  mathematical  model  suggest  a  four-way  analysis  of  variance 
(ANOVA).  The  results  from  each  5-min.  cut  and  time  were  used  as  the  variables  of  interest 
versus  tool  wear  and  surface  finish  to  check  for  any  possible  interactions.  The  JMP 
software  package  by  SAS  was  used  to  produce  the  results.  Tool  wear  and  surface  finish 
were  dependent  variables  in  conjunction  with  independent  variables:  tool  type,  speed,  feed, 
and  time.  The  resulting  F  ratios  and  sum  of  squares  for  tool  wear  is  listed  in  Table  n.  The 
results  for  surface  finish  are  listed  in  Table  HI. 

Table:  II  ANOVA  results  for  Tool  Wear 

Degrees  of  Sum  of 
Freedom  Squares 

Source  of  Variation _ (dF> _ (SS) _ F  Ratio _ Prob>F 


Tool  Type 

1 

0.00037950 

35.4845 

0.0000 

Speed 

2 

0.00074845 

34.9914 

0.0000 

Tool  Type  *  Speed 

2 

0.00109753 

51.3111 

0.0000 

Feed 

2 

0.00223420 

104.4523 

0.0000 

Tool  Type  *  Feed 

2 

0.00051219 

23.9457 

0.0000 

Speed  *  Feed 

4 

0.00083297 

19.4713 

0.0000 

Tool  Type  *  Speed  *  Feed 

4 

0.00272607 

63.7242 

0.0000 

Time 

3 

0.00011800 

3.6777 

0.0261 

Tool  Type  *  Time 

3 

0.00002355 

0.7341 

0.5419 

Speed  *  Time 

6 

0.00004860 

0.7574 

0.6101 

Tool  Type  *  Speed  *  Time 

6 

0.00004657 

0.7257 

0.6332 

Feed  *  Time 

6 

0.00004666 

0.7272 

0.6321 

Tool  Type  *  Feed  *  Time 

6 

0.00002006 

0.3126 

0.9241 

Speed  *  Feed  *  Time 

12 

0.00003002 

0.2339 

0.9941 

Tool  Type  *  Speed  *  Feed 

*  Time  12 

0.00005570 

0.4340 

0.9331 

Root  Mean  Square  Error  = 

0.003270 

R  Square  =  0.969219 

Table  III:  ANOVA  results  for  Surface  Finish 

Degrees  of  Sum  of 

Freedom  Squares 

Source  of  Variation 

__  . fdF) 

CSS) 

F  Ratio 

Prob>F 

Tool  Type 

1 

105.06 

0.1031 

0.7509 

Speed 

2 

4573.23 

2.2449 

0.1277 

Tool  Type  *  Speed 

2 

27108.16 

13.3071 

0.0001 

Feed 

2 

113574.00 

55.7522 

0.0000 

Tool  Type  *  Feed 

2 

2063.73 

1.0131 

0.3781 

Speed  *  Feed 

4 

38211.99 

9.3789 

0.0001 

Tool  Type  *  Speed  *  Feed 

4 

38788.72 

9.5205 

0.0001 

Time 

3 

1718.74 

0.5625 

0.6450 

Tool  Type  *  Time 

3 

926.02 

0.3030 

0.8229 

Speed  *  Time 

6 

3536.28 

0.5786 

0.7436 

Tool  Type  *  Speed  *  Time 

6 

6421.75 

1.0508 

0.4183 

Feed  *  Time 

6 

1706.57 

0.2792 

0.9412 

Tool  Type  *  Feed  *  Time 

6 

3758.25 

0.6150 

0.7162 

Speed  *  Feed  *  Time 

12 

8402.03 

0.6874 

0.7473 

Tool  Type  *  Speed  *  Feed 

*  Time  12 

8099.34 

0.6626 

0.7688 

Root  Mean  Square  Error  = 

31.91491 

R  Square  =  0.908297 
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Testing  of  the  hypothesis  that  each  corresponding  source  of  variation  has  a  statistically 
significant  effect  on  tool  wear  and  surface  finish  was  done  by  calculating  the  corresponding 
F  statistic  and  associated  p- value  (Prob>F).  In  the  analysis  of  tool  wear  we  find  that  all  the 
single  sources  of  variation  (Tool  type,  Speed,  Feed,  and  Time)  are  significant  at  the  5% 
level  of  significance.  In  addition,  the  interactions  of  Tool  Type  *  Speed,  Tool  Type*  Feed , 
Speed  *Feed,  and  Tool  Type  *Speed  *Feed  are  also  significant. 

The  effects  of  speed  and  feed  on  tool  wear  and  surface  finish  have  been  well  documented 
and  studied.  1.19,22,23  These  interactions  could  be  expected  in  an  experiment  of  this  type. 
The  high  F  value  for  Tool  Type  signifies  a  statistical  difference  in  the  two  tool  types  on  tool 
wear,  but  the  small  F  value  for  surface  finish  shows  little  effect.  Implantation  does  have  a 
significant  effect  on  tool  wear  but  not  surface  finish. 

The  resulting  r2  of  0.979219  indicates  that  there  is  a  strong  linear  relationship  between  the 
variable  of  interest,  tool  wear,  and  the  resulting  model  tested  in  the  ANOVA.  The  surface 
finish  model  showed  a  strong  linear  relationship  as  well  with  a  r2  of  0.908297. 

V.  Discussion 

The  wear  rates  for  the  implanted  and  unimplanted  tools  are  similar  at  the  high  speeds  in  this 
experiment.  The  implanted  tools  performed  better  than  the  unimplanted  tools  at  the  middle- 
and  lower-speed  levels.  The  researchers  believe  that  these  similarities  can  be  expected 
because  of  die  nature  of  ion  implantation.  The  similarities  of  the  tools  at  the  highest  speed 
level  are  attributed  to  the  high  temperatures  that  are  present  in  the  tool  work  piece  interface. 
The  high  temperatures  precipitate  thermal  softening  at  the  tip  of  the  tool.  Thermal  softening 
is  the  common  mode  of  failure  in  high-speed  steel  tools.23  The  implantation  effects  have 
been  known  to  dissipate  quickly  at  high  temperatures. 

Ion  implantation’s  effect  on  surface  hardness  and  wear  resistance  has  been  well  studied  in 
the  laboratory  environment.  This  study  shows  that  ion  implantation  does  have  a  significant 
effect  on  improving  wear  resistance  of  high-speed  steel  tools  when  implanted  with 
nitrogen.  Ion  implantation  does  not  have  a  sizable  effect  on  performance  as  measured  by 
the  resulting  surface  finish  of  the  work  piece. 

Further  research  needs  to  be  done  on  effects  that  implantation  has  on  various  cutting  tools. 
Many  avenues  still  need  to  be  explored.  Because,  implantation  does  effect  the  tool’s  wear 
resistance,  the  new  tool  needs  to  have  recommended  speeds  and  feeds  developed  for  it  to 
fully  utilize  its  increased  performance.  These  studies  could  be  expanded  to  include 
implanting  other  tool  types  (carbides,  coated  carbides,  TiN-coated  tools)  and  their 
corresponding  wear  improvements  to  develop  corresponding  tool  life  equations  and 
recommended  processing  parameters. 

The  basic  research  to  this  point  involved  understanding  the  effects  of  changing  voltages, 
fluences,  dopants,  and  other  implantation  variables  to  find  an  optimum  hardness 
improvement  in  the  material.  This  research  should  be  expanded  to  include  tooling  to 
determine  the  optimum  implantation  parameters.  This  study  used  nitrogen  at  45  keV 
because  that  amount  had  been  determined  to  be  the  optimum  level  to  increase  hardness  in 
tool  steel.  The  increase  in  hardness  may  not  result  an  increase  in  wear  resistance  and  should 
be  studied  further. 
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ABSTRACT: 

This  paper  presents  results  from  an  Army  Research  Office  grant 
developing  knowledge  based  systems  for  Computer  Numerical  Control 
(CNC)  machining  operations.  One  of  the  areas  of  development 
involves  knowledge/data  acquisition  for  input  parameters  to  the 
knowledge  processor.  In  our  work  we  chose  artificial  neural  networks 
as  a  mechanism  for  capturing  data  and  then  utilized  the  ability  of  the 
networks  to  determine  certain  process  parameters  for  automating 
specific  tasks  in  CNC  programming.  Artificial  neural  networks  are  well 
adapted  to  this  task  because  they  can  be  continuously  improved. 

INTRODUCTION 

An  increasing  need  for  the  rapid  production  of  small  batches  and  prototype  parts  will 
drive  manufacturers  to  seek  new  ways  to  decrease  lead  times  and  costs  associated  with 
Computer  Numerical  Control  (CNC)  programming  and  production  planning.  Planning 
and  developing  CNC  programs  for  the  production  of  mechanical  parts  can  be  an 
expensive  and  time-consuming  process.  A  process  engineer  (domain  expert),  who 
traditionally  performs  these  functions  routinely  applies  learned  skills  to  prescribe 
manufacturing  operation  steps  and  specific  machining  parameters.  One  author 
summarizes  the  problem  as  follows: 

This  knowledge  is  better  described  as  many  differently  weighted  pieces 
of  advice  than  as  a  small  set  of  constraints  to  be  satisfied.  Such 
planning  involves  taking  into  account  a  great  variety  of  both 
technological  rules  and  economic  considerations.  Furthermore,  they 
are  not  absolute.  They  are  more  preferences,  between  which 
compromises  may  be  necessary.  In  addition,  they  may  differ  somewhat 
from  one  company  to  another.  In  fact,  they  represent  the  experience 
and  know-how  of  engineers.  An  immediate  consequence  is  that  the 
planning  of  machining  sequences  definitely  does  not  have  a  unique 
solution.  (Descotte  and  Latombe,  1989) 

Problem  Identification 

Commercial  CAD/CAM/CAPP  software  do  not  currently  facilitate  the  automatic 
accumulation  of  design  and  manufacturing  data  produced  throughout  the  planning  and 
production  of  parts.  Today's  CNC  mfcTSie  tool  programming  methods  are  often 


inefficient  and  do  not  utilize  the  product  model  concept.  Plans  are  often  produced  and 
executed  on  the  fly,  modified  on  only  the  final  hard  copy,  and  inaccessible  by  people 
responsible  for  job  costing.  Furthermore,  the  intelligence  and  know-how  to  manufacture 
and  track  parts  resides  in  a  wide  base  of  skilled  crafts-people,  process  engineers  and 
manufacturing  experts. 

Commercial  CAD/CAM  gets  limited  acceptance  among  CNC  machine  operators 
and  CNC  programmers  who  perceive  that  their  jobs  are  being  replaced  by  computers. 
And  of  course  the  systems  available  for  efficiently  capturing  the  knowledge  of  the  'skilled' 
are  for  the  most  part  still  in  a  state  of  development. 

Artificial  Neural  Networks 

There  has  been  an  increase  of  interest  in  brain  style  computing'  in  terms  of  artificial  neural 
networks  (ANNs).  ANNs  are  parallel  distributed  processing  architectures  composed  of  a 
huge  number  of  small  interacting  elements  that  are  massively  interconnected.  Each  of 
these  elements  sends  excitatory  and  inhibitory  signals  to  other  units  that  in  turn  update 
their  behaviors  based  on  these  received  messages.  ANNs  emulate  the  functionality  of  the 
human  brain  that  implicitly  stores  knowledge  in  the  interconnection  weights,  and  not  in 
the  neurons  themselves  (Rumelhart  and  McClelland,  1986).  Learning  is  thus  achieved  by 
modifying  the  connection  weights  between  elements.  Feedforward  neural  nets  acquire 
knowledge  through  training,  where  by  repeatedly  presenting  sample  cases  to  the  net,  its 
interconnection  weights  change  accordingly  to  model  the  representation  of  the  cases 
(Shalkof,  1992).  Knowledge  is  stored  in  the  final  interconnections  values.  As  such  neural 
network  methodology  would  ease  the  knowledge  acquisition  bottleneck  that  is  hampering 
the  creation  of  expert  system  (Zurada,  1992).  The  key  to  a  neural  network's  power  is  the 
inherent  parallelism  resulting  in  fast  computation,  robustness  or  error  resistance,  and 
adaptiveness  or  generalization.  A  typical  network  consists  of  a  set  of  processing  elements 
(PE)  grouped  hierarchically  in  layers  and  interconnected  in  some  fashion.  These  PEs  sum 
N  weighted  inputs  and  transfer  that  outcome  through  a  mathematical  function.  One  of  the 
most  popular  neural  paradigms,  and  the  one  that  has  been  used  in  this  project,  is  the 
multilayer  perceptron  trained  using  the  Error  Back  Propagation  Training  Algorithm 
(EBPTA)  (Dayhoff,  1990).  Within  the  manufacturing  environment  ANNs  have  been 
applied  to  quality  control,  process  control,  part-family  classification  (Martinez,  Smith,  and 
Bidanda,  1993;  Telasco,  1993;  Udo,  1992;  Cook  and  Shannon,  1992;  Charraprty  and 
Uptal,  1993)  Neural  network  models  have  been  also  used  for  optimizing  cutting 
parameters  of  turning  operations  (Wang,  1993). 

Neural  network  models  have  key  advantages  over  rule-based  systems.  First,  the 
learning  process,  or  the  knowledge  acquisition  process,  takes  place  by  presenting  only 
sample  (machining)  cases  to  the  network.  Learning  occurs  through  modifying  the 
connection  weights  between  network  elements  (by  a  learning  algorithm)  to  model  the 
representation  of  the  cases  (Shalkof,  1992).  Knowledge  is  thus  stored  implicitly  in  the 
final  interconnection  values.  Second,  because  training  examples  need  not  be  exhaustive,  it 
is  conceivable  that  neural  network  methodology  could  ease  the  knowledge  acquisition 
bottleneck  that  is  hampering  the  creation  of  rule-based  systems  (Zurada,  1992).  Finally, 
when  utilized  as  a  decision  processor,  an  artificial  neural  network  has  inherent  parallelism 
resulting  in  fast  computation,  robustness  or  error  resistance,  and  adaptiveness  or 
generalization. 
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Computer  Aided  Process  Planning 

Computer  Aided  Process  and  operations  Planning  (CAPP)  systems  attempt  to  automate 
the  decision  making  process  necessaiy  to  map  design  specifications  on  the  company 
specific  manufacturing  environment.  Many  CAPP  systems  utilize  some  form  of  Artificial 
Intelligence  (AI)  to  model  the  human  interaction  previously  required  by  the  task. 
CAD/CAM  software  and  databases  are  used  to  supply  product  descriptions  and 
information  management,  respectively.  Material  removal  operations  provide  a  rich 
environment  for  the  implementation  of  intelligent  applications.  Much  of  the  current  work 
in  this  field  concentrates  on  the  AI  technique  used  to  process  the  knowledge  and/or  the 
solid  model  design  features  utilized  by  the  engineer  to  initially  create  the  part  (Anderson 
and  Chang,  1990;  Chung,  Patel,  and  Cook,  1990;  Joshi,  Vissa,  and  Chang,  1989;  Phillips 
and  Mouleeswaran,  1985). 

Manufacturing  Assistants 

Manufacturing  Assistants  are  systems  which  emphasize  the  specific  decisions  and 
associated  data  which  must  be  considered  when  selecting  tools,  cutting  speeds,  tool 
motions,  etc.  in  a  manufacturing  or  machining  environment.  As  discussed  here,  the  term 
machining  is  defined  as  the  removal  of  unwanted  material  from  a  workpiece  in  chip  form 
so  as  to  obtain  a  completed  product  that  meets  size,  shape  and  finish  characteristics 
specified  by  the  designer.  The  following  factors  contribute  to  the  need  for  improved 
methods  to  manage  the  manufacture  of  machined  parts:  advanced  machine  tool  and 
computer  technology,  higher  quality  and  stricter  tolerance  and  finish  requirements,  new 
materials  such  as  plastics,  composites,  and  exotic  alloys,  smaller  batch  sizes  and  prototype 
parts,  more  complex  part  descriptions,  and  a  more  competitive  marketplace. 

The  Turning  Assistant 

The  Turning  Assistant  (TA)  is  a  direct  outcome  of  an  ARO  project  which  automatically 
creates  plans  for  Computer  Numerical  Control  (CNC)  lathe  operations.  Starting  with  a 
drawing  and  a  set  of  predefined  machines  and  cutting  tools,  the  TA  interactively  defines 
and  produces  code  for  a  variety  of  features  such  as  profiles,  grooves,  and  threads.  The 
TA  focuses  on  the  formalization  of  domain  knowledge,  providing  document  support,  and 
reducing  the  costs  and  lead  times  associated  with  the  planning  function.  For  advanced 
systems  to  be  usable  in  shops  which  currently  dominate  small  batch  production  of 
machined  parts,  flexible  tools  for  fitting  the  application  to  the  environment  must  be  made 
available. 

The  Turning  Assistant  is  a  system  that  is  based  on  previous  work  providing 
automated  programming  for  a  variety  of  milled  features.  The  "Milling  Assistant"  (MA) 
(Burd,  1989)  evolved  from  an  Expert  System  based  decision  processor  to  a  Case-Based 
Reasoning  (CBR)  system  to  automate  the  decision  process  required  for  machining  milled 
and  point-to-point  features  on  CNC  machine  tools. 

PRODUCT  MODELS 

The  complete  electronic  product  definition  for  a  manufactured  part  consists  of  all  the 
information  related  to  designing,  producing,  inspecting,  and  managing  the  manufacture  of 
that  part.  The  data  provided  by  the  product  model  must  be  accessible,  well  organized, 
and  conform  to  standards  prevalent  in  b4otfi  Vdustry  and  government.  The  Computer 


Aided  Acquisition  and  Logistics  Support  (CALS)  and  Product  Definition  Exchange 
Specification  (PDES)  /  Standard  for  the  Exchange  of  Product  Data  (STEP)  initiatives  are 
currently  addressing  these  very  issues. 

Knowledge  processor 

Numerous  systems  can  be  utilized  for  decision  processing.  The  TA  decision  processor  is 
written  in  GRAPL-IV,  a  FORTRAN  77  based  application  interface  within  the  ANVIL- 
5000  CAD/CAM  system  and  is  used  to  determine  machining  strategies  and  effectively 
execute  the  operation.  Another  is  based  on  a  series  of  ANN’S  written  in  C,  which  can  be 
called  as  needed  from  the  application  interface  within  the  ANVIL-5000  CAD/CAM 
system.  Another  set  of  C  programs  can  make  calls  to  MetCAPP-HI  or  MetCAPP-IV  to 
acquire  necessary  machining  parameters  and  cutting  strategies. 

KNOWLEDGE  ACQUISITION  AND  REPRESENTATION 

The  most  important  and  difficult  aspect  of  this  research  is  to  develop  effective  tools  with 
which  to  elicit  knowledge  from  domain  experts.  The  skills  obtained  through  years  of 
observation,  training,  improvising,  and  trial-and-error  are  not  easy  to  explain  verbally, 
document,  or  formalize  for  the  purpose  of  building  automated  CNC  programming 
systems.  A  significant  amount  of  time  must  be  spent  in  familiarizing  oneself  with  the  field 
and  terminology  before  attempting  to  solicit  strategies  directly  from  the  CNC 
programmers  and  machinists. 

ANN’S  have  a  distinct  advantage  in  speed,  development  and  knowledge  updating 
(hence  learning).  Automatic  creation  of  CNC  programs  requires:  1)  user-interaction  --  to 
gather  location  specific  operating  information,  2)  interrogation  of  the  features  geometric 
description  within  the  CAD  database,  3)  a  file  based  communication  link  to  a  decision 
processor  and,  4)  An  applications  interface  language  capable  of  building  tool  paths  from 
user  generated  programs.  Very  few  commercial  CAD/CAM  systems  provide  the  user 
with  enough  power  to  accomplish  these  tasks. 

Representing  the  Knowledge 

This  research  helped  to  develop  knowledge  elicitation  and  representation  tools  that  can  be 
used  to  formalize  the  expertise  gained  through  machining  strategy  meetings,  literature 
review,  and  programmer  feedback.  It  includes  the  design  of  questionnaires  that  can  be 
used  to  document  strategies  in  a  set  format  during  interviews  or  literature  review  and  data 
collection  forms  for  training  ANNs  (see  Table  1). 


TABLE  1:  ANN  TOOL  TYPE  DATA  COLLECTION  FORM 


Input  Decision  Variables 

Output  Decision  Variables 

* 

Variable  Name 

Value 

Weight  Factor 

The  correct  tool  material: 
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Part  (stock)  material 

El 

m 

B 

B 

B 

10 

1 .  High  Speed  Steel 

b 

Part  material  bH 

El 

H 

D 

B 

B 

10 

2.  Carbide 

B 

iMHRnn 

El 

ES 

B 

B 

10 

3.  Ceramic 

a 

Feature  Depth 

B 

B 

D 

B 

B 

10 

Comments: 

Artificial  Neural  Networks  as  Knowledge  Acquisition  Tools 

The  adopted  research  methodology  aimed  at  investigating  the  utilization  of  a  neural-based 
knowledge  acquisition  approach.  It  is  based  on  the  operational  mode  of  neural  systems, 
and  taking  advantage  of  our  knowledge  and  experience  in  developing  rule-based  systems 
for  similar  tasks  (i.e.  milling  assistant  (Burd,  1990)).  This  methodology  satisfies  a  number 
of  basic  requirements.  First,  it  must  reduce  the  burden  associated  with  developing  rule- 
based  systems  by  avoiding  constructing  general  rules,  because,  as  explained  earlier, 
experts  are  not  good  at  generalizing  solutions.  Second,  this  methodology  must  ensure 
that  knowledge  be  cast  in  a  form  usable  by  neural  systems;  forms  of  rules  are  excluded. 
What  is  required  by  a  neural  system  is  a  set  of  training  examples,  where  each  example 
represents  a  real  machining  case  described  by  a  vector  of  numeric  values  of  process 
variables  encoded  in  a  convenient  from.  Third,  consequently,  the  new  methodology  must 
employ  effective  instrumental  knowledge  gathering  forms;  and  since  no  general  inferential 
statements  are  required  at  this  stage,  the  knowledge  gathering  forms  can  be  mere  data 
collection  forms 

A  neural  network  paradigm  must  first  be  selected  in  consensus  with  the  application 
at  hand.  In  this  research,  a  supervised  neural  network  model  is  employed  as  a  classifier. 
A  supervised  learning  model  is  selected  as  it  enables  the  expert  to  express  his  judgmental 
opinion  in  a  relative  manner.  This  feature  is  very  essential  because,  in  this  application,  it  is 
not  required  to  group  the  training  examples  into  clusters  as  would  be  the  outcome  if  an 
unsupervised  learning  model  were  used.  Instead,  every  single  training  example  must  be 
explicitly  identified  (by  the  expert)  as  belonging  to  a  specific  class  that  represents  a 
machining  decision.  The  neural  model  is  employed  as  a  classifier  so  that  the  output 
decision  belongs  to  a  class  of  objects  (e.g.  type  of  tool  material:  HSS,  Carbide,  Ceramic), 
or  to  a  range  of  numeric  values  (e.g.  tool  size:  l/4"-l/2",  etc.).  This  method 

shows  great  promise  considering  the  variety  of  machining  choices  that  can  satisfy  the 
requirements  of  a  quality  product.  See  Table  2  for  a  representative  example  of  the  ANN 
training  and  testing  results. 
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TABLE  2:  TOOL  MATERIAL  NETWORK  CLASSIFICATION 

PERFORMANCE 
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EXECUTIVE  SUMMARY 

Accurate  real  time  displacement  measurements  are  essential  to  increasing  the  productivity 
and  decreasing  the  cost  of  ultra-high  precision  machining.  This  research  presents  a  first 
step  in  that  direction.  We  have  demonstrated  a  simple  measurement  system  that  can 
operate  with  the  machine  cutting  and  maintain  an  measurement  accuracy  of  0.2  micro¬ 
inches.  We  used  the  sinusoidal  phase  measurement  technique  using  laser  diodes,  SPM-LD, 
and  have  invented  a  new  algorithm  for  data  analysis  that  allows  for  the  first  time  the 
extension  of  this  method  to  dynamic  ranges  on  the  order  of  ten  inches  with  high  accuracy. 
We  have  demonstrated  a  dynamic  range  that  is  a  factor  of  1,000  greater  than  the  previous 
research.  However,  our  demonstrated  dynamic  range  was  limited  by  our  hardware  to  0.04 
inches,  better  hardware  will  easily  increase  the  dynamic  range  to  several  inches  at  high 
accuracy.  In  general,  the  dynamic  range  can  be  increased  if  we  relax  the  accuracy 
requirements.  Finally,  an  invention  disclosure  is  being  filed  on  this  measurement  technique. 
The  original  goal  of  this  effort  was  to  demonstrate  a  system  that  had  a  dynamic  range  of  lA 
inch  and  a  resolution  of  20  micro-inches  in  a  vibrating  environment,  as  you  can  see  we 
have  clearly  exceeded  the  original  goals.  The  next  step  in  this  research  would  be  to  build  a 
prototype  system  to  place  on  a  diamond  turning  machine. 

This  work  was  the  MS  thesis  of  Mr.  Chi  Man  and  he  was  supported  by  this  grant.  Also 
supported  in  this  grant  were  Mr.  Lawrence  Alvarez,  a  staff  engineer  at  NMSU,  Prof.  Yi 
Fang  Wu,  a  visiting  professor  from  Bejing  University,  Dr.  Bing  Yin  and  Professor  T.  M. 
Shay  of  New  Mexico  State  University. 
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1. 


Introduction 


Our  purpose  is  to  develop  a  real-time  measurement  displacement  system  with  mm 
order  dynamic  range  and  5  nm  resolution.  Thus,  we  are  able  to  obtain  a  wide  dynamic 
range  and  high  resolution  in  a  simple  interferometer  that  can  tolerate  vibration  of  up  to 
200  nm.  This  system  can  provide  the  crucial  part  of  a  closed  loop  for  precision  machine 
tools. 


As  is  well  known,  interferometers  are  theoretically  capable  of  wide  dynamic  range 
and  high  resolution.  The  resolution  is  theoretically  limited  only  by  the  quantum  noise 
(shot  noise)  from  a  photodetector.  It  is  difficult  to  implement  both  wide  dynamic  range 
and  high  resolution  simultaneously  in  a  single  experimental  system,  because  high 
resolution  depends  upon  how  a  small  fraction  of  the  wavelength  can  be  resolved  and 
dynamic  range  depends  on  how  many  fringes  are  counted  reliably. 

Several  techniques  have  been  used  to  overcome  these  drawbacks:  two  wavelength 
interferometry[2],  frequency  modulated  continuous  wave  LD  interferometryt3]  ect.  In  Ref. 

3  the  wavelength  is  linearly  ramped  and  that  system  demonstrated  a  range  of  between  5  - 
600  mm  with  a  resolution  of  50  pm,  the  resolution  is  too  low  for  precision  machining 
applications.  In  References  4  and  5,  the  Sinusoidal  Phase  Modulation  (SPM)  method  is 
used,  there  the  wavelength  is  modulated  with  a  sinusoidal  wave  and  the  phase  of  the 
interfering  signal  depends  upon  the  path  difference  of  the  interferometer  legs.  The  phase  of 
interference  signal  is  calculated  using  a  synchronous  detection  to  obtain  the  amplitude  of 
the  first  and  second  harmonic  components  of  the  SPM  frequency.  They  demonstrated  a 
dynamic  range  of  X  and  measure  a  resolution  of  X/800  and  show  that  a  range  of  23  •  X 
and  a  resolution  of  X/255  should  be  possible.  The  range  of  there  system  was  limited  by  the 
requirement  that  the  amplitudes  of  the  first  and  second  harmonic  detection  loops  be 
appropriately  balanced.  The  required  ratio  of  the  gain  for  the  first  and  second  harmonic 
loops  changes  as  the  displacement  changes  and  hence  using  their  algorithm  the  dynamic 
range  must  be  kept  short  or  the  accuracy  falls  off  rapidly.  Furthermore,  as  the  gain  of  the 
circuit  is  changed  the  relative  phase  shift  between  the  2  circuits  will  change,  since  we  are 
measuring  a  phase  change  this  introduces  a  systemic  error. 

In  this  work,  we  developed  the  theory  of  Wide  Dynamic  Range  Sinusoidal  Phase 
Modulation  Laser  Diode  (WDR-SPM-LD)  interferometry  which  uses  a  slowly-varying 
approximation  and  we  present  a  new  algorithm  for  calculating  the  phase  of  the  SPM 
signal.  The  advantages  of  this  new  algorithm  are:  first,  it  is  significantly  less  sensitive  to 
changes  in  the  amplitudes  of  the  first  or  second  harmonic  and  hereby  increases  the 
dynamic  range  by  3  orders  of  magnitude;  second,  the  simplicity  of  the  phase  demodulating 
circuit  is  preserved;  finally,  it  can  be  implemented  on  a  computer  for  a  real-time  automatic 
measurement  displacement.  Thus,  our  system  can  be  implemented  in  a  closed  loop  wide 
dynamic  range  system  for  precision  machining.  Our  system  consists  of  two  unbalanced 
Michelson  interferometers,  a  harmonic  phase  demodulator,  an  active  feedback  control 
system  that  compensates  for  thermal  drifts,  as  yell  as,  reduces  the  effects  of  external 
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environmental  disturbances,  12-bit  A/D  conversion  and  automatic  real-time  measurement 
displacement  in  Lab  VIEW  software  by  a  computer.  We  used  both  a  PZT  and  a  Picomotor 
as  the  displacement  actuators  for  measuring  the  resolution  or  a  dynamic  range. 

2n 

Experimental  results  show  that  1  mm  range  and  phase  resolution  are  achieved.  The 


phase  resolution  corresponds  to  a  0.2  nm  spatial  resolution.  The  dynamic  range  has  been 
increased  by  over  3  orders  of  magnitude  compared  to  the  previous  state  of  the  art[5].  The 
absolute  accuracy  of  the  measurements  depends  upon  the  reproducibility  of  the  actuator 
itself.  The  New  Focus  Picomotor  was  used  for  the  wide  dynamic  range  measurements  and 
as  stated  by  New  Focus  the  positioning  precision  is  0. 1  %,  this  was  calibrated  against  a 
HeindenHain  gauge  and  compared  to  the  measurements  of  our  system.  The  accuracy  of 
our  measurements  agreed  with  the  measurements  by  the  Heindenhain  gauge.  Using  WDR- 
SPM-LD  we  were  able  to  calibrate  our  Picomotor. 


2.  Theory 

2.1  Laser  Diode  Sinusoidal  phase  modulating  (SPM)  interferometer 


Figure  la  shows  the  setup  of  SPM-LD  interferometer  for  real-time  displacement 
measurement.  The  optical  configuration  is  an  unbalanced  Michelson  interferometer.  The 
light  emitted  from  the  laser  diode  (LD)  passes  through  an  optical  isolator  (01)  to  prevent 
optical  feedback  and  is  split  into  a  reference  beam  and  an  object  beam  with  a  beam  splitter 
(BS).  The  optical  path  difference  between  the4wo  arms  of  the  interferometer  is  i  =  2x . 
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The  injection  current  of  the  LD  consists  of  a  dc  bias  current  i0  and  modulation  current 
/m(t),  which  are  supplied  by  the  laser  diode  controller  (LC).  Current  z0  determines  the 
central  wavelength  the  \  of  the  LD.  The  modulation  current  im( t)  is 

im(t)  =  a  cos  (coj)  (1) 

Because  the  modulation  current  changes  the  wavelength  by  fiijj) ,  sinusoidal  phase 

modulation  is  achieved,  where  |3  is  the  modulation  efficiency.  The  interference  signal  is 
detected  by  a  photodetector,  PD.  The  time- varying  component,  or  ac  component,  of 
interference  is  given  by 

P(f)  =  P0  cos  [z  cos  COj  +  0\  (2) 


/  4  n 

where  z  =  Anapx  ,  0  =  and  X  is  the  distance  difference  between  the  two 
arms  of  the  interferometer,  Po  is  the  amplitude  of  the  ac  component. 


2.2  The  Harmonic  Phase  Demodulator 


Fig.  lb  Shows  a  block  diagram  of  the  harmonic  demodulation  system. 


In  equation  (2),  the  unknown  displacement  x  is  obtained  from  the  phase  0  ,  but  0 
has  an  ambiguity  of  2n.  The  key  task  is  accurately  and  unambiguously  to  extract  the 
interferometric  phase.  For  this  purpose  the  demodulator  is  required  to  have  a  high 
resolution  and  a  wide  dynamic  range.  In  this  article,  we  use  multi-harmonic  phase 
demodulation,  that  is,  the  interference  signal  P(t)  is  synchronously  demodulated  by 
multiplying  it  times  zm(t)  and  integrating  to  obtain  the  amplitude  of  the  quadrature 
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components,  or  sin  0  and  cos  0.  The  block  diagram  is  shown  in  Fig.  lb.  Expanding  Eq. 
(2),  we  obtain 

P(?)  =  P0  cos  0[JO  (z)  -  2h  (z)  cos  2  eomt  +. . .  ] 

-P0  sin  (^21  x{z)  cos  comt-2J3(z )  cos  7>comt  +  ...] 

where  */„(z)  is  an  n’th-order  Bessel  function.  Multiplying  P(t)  by  im(t)  and  passing  it 
through  the  low-pass  filter  (LPF),  we  obtain  the  signal  associated  with  the  first  harmonic 
frequency  component  as  follows: 

=  Ax  sin  0  (4) 


where 


A  =  -2Ky  P0aJ\{z)  (5) 

where  is  the  LPFi  gain.  Squaring  im(t)  and  suppressing  the  dc  component,  we  obtain 
the  signal 


COS 


(2  aj) 


(6) 


Multiplying  P(t)  by  im2(t)  and  passing  it  through  the  low-pass  filter  (LPF2),  the  desired 
signal  associated  with  the  second  frequency  component  can  be  obtained  as  follows: 

Y2  =  A2  cos  0  (7) 

where 

A2=-K2P0a2J2(z )  (8) 

where  K2  is  the  LPF2  gain.  The  signal  processing  to  obtain  the  signal  of  Eq.  (4)  and  (7) 
are  called  harmonic  demodulator.  Combined  with  Eq.  (4)  and  (7),  the  phase  0  can  be 
calculated  from 


0  =  tan'1 


(9) 


Thus,  once  0  and  X  are  known  then  tjig^lisplacement  can  be  calculated  using, 
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2.3  Slowly-varying  Approximation 

If,  as  in  Reference  [5],  Ki  and  K2  were  set  to  satisfy  the  condition,  AX=A2,  then 
Eq.  (9)  becomes 


6  =  tan  1 


(£) 

UJ 


(10) 


The  authors  of  Ref.  [5]  used  Eq.  (10)  to  calculate  the  interference  phase ,  then 
displacement.  However,  due  to  the  limited  gain  range  of  LPF,  especially,  the  unbalance  of 
the  electronic  circuits  and  imperfection  of  the  electrical  components,  the  additional  errors 
due  to  phase  shifts  of  the  electronic  circuits  occur  when  the  gain  of  LPF  is  adjusted.  Thus, 


it  is  difficult  to  increase  the  dynamic  range  beyond 


Now,  we  present  a  new  principle  that  provides  a  much  wider  dynamic  range 
displacement  measurement  and  avoids  any  adjustments  in  the  circuit  gain. 

In  contrast  to  the  assumptions  required  inorder  to  use  Eq.  10  the  amplitudes  Ax  and  A2 
change  as  a  function  of  %  and  thus  Eq.  10  is  not  valid  for  large  dynamic  ranges.  We 
develop  the  slowly-varying  approximation  theory  that  automatically  compensates  for 
changes  of  the  amplitudes  Ax  and  A2.  The  theory  is  summarized  below. 


Since  the  change  of  the  amplitudes  Ax  and  A2  vary  extremely  slowly  over  distances  on  the 
order  of  A.  and  a  single  step  of  the  positioning  device  is  limited  to  less  than  A/2.  Thus  we 


can  use  can  use  the  slowly-varying  approximation,  that  is. 


dx 


0  and 


dA,{x) 

dx 


«  0 . 


Differentiating  Eqs.  (4)  and  (5).  We  can  obtain 

(11) 
(12) 

Combining  Eqs.  (4),  (7)  and  (1 1),  (12),  we  get 


— 1  =  A,(2K)  cos(2Kx) 
dx 

=  A2(-  2 K)  sin(2Kx) 
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tan2  6 


Y]  \  dc 
YjdY, 


the  phase  0  is  calculated  from 


6  =  tan-1  (- 1) 


2l  \dx 

Y2‘ (dr, 


dY  dY. 1 

Considering  the  signs  of  Yl9  Y2,  — and  — ^  in  four  quadrants,  it  is  not  difficult  to 

understand  the  meaning  of  the  minus  of  square  root  in  the  right  of  Eq.  (14).  From 
Eq.(14),we  developed  a  novel  method  to  calculate  an  interference  phase,  in  which  an 
affecting  to  Yi  (or  Y2)  due  to  the  change  of  amplitudes  Ai  (or  A2)  itself  has  been 

considered  and  determined  by  using  —  and  in  SPM  LD  interferometry. 

dx  dx 


3.  The  Feedback  Control  System  to  Reduce  the  External  Disturbances 

Wavelength  modulating  is  easy  to  implement  with  a  laser  diode  since  its  output 
wavelength  varies  linearly  with  the  injection  current  over  most  of  its  operating  range. 
However,  the  wavelength  fluctuation  AXW  due  to  changes  in  the  operating  conditions  has 
to  be  overcome  if  high  precision  measurements  are  to  be  made. 

In  addition,  optical  devices  in  an  interferometer  vibrate  in  response  to  external 
mechanical,  acoustic  vibrations.  Moreover,  the  interferometer  components  undergo 
thermal  expansion  due  to  fluctuations  in  temperature.  All  of  these  effects  cause  the 
changes  A£  in  the  optical  path  difference  £  between  the  object  and  reference  waves. 
These  AXW  and  A£  cause  the  fluctuation  in  the  phase  of  the  interference  signal.  The 
fluctuation  can  be  reduced  by  controlling  the  injection  current  to  produce  the  change  AXf 
in  the  wavelength  of  LD,  as  reported  in  Ref.  [4], 
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Fig.  2  Block  diagram  of  the  experimental  system  for  SPM-LD  interferometry. 

As  shown  in  Fig.  2,  we  configured  a  pair  of  two-beam  interferometers  sharing  a 
common  LD,  where  one  (SYS1)  provides  an  interferometric  signal  generated  by  the  beam 
from  the  target,  and  the  other  (SYS2)  serves  as  a  feedback  signal  generator.  The  beam 
from  LD  was  split  into  two  beams  by  beam  splitter  BS1,  and  introduced  into  second 
independent  Michelson  interferometers.  The  reference  interferometer  has  two  fixed  arms 
with  a  known  optical  path  difference  2dr,  while  the  object  interferometer  has  a  fixed  arm 
and  a  variable  arm  with  a  movable  mirror.  Let  us  explain  how  the  feedback  signal  is 
generated  in  the  reference  interferometer. 

The  interference  signal  P2{t)  in  the  reference  interferometer 

P2{i)  =  PQ2cos[Zrcos((omt)  +  0r  +4(0]  O5) 


and 


Amp 

Zr  =  -  dr 

Ao 

(16) 

"3 

il 

(17) 

Aq  Aq 

(18) 
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The  feedback  signal  if{t)  is  produced  by  the  harmonic  demodulation  produce  in  SYS2. 

As  shown  in  Fig.  2,  Yf(t)  is  fed  to  the  amplifier,  whose  gain  in  Kp,  and,  we  obtain  the 
feedback  signal: 

J,(Zr)sm(er+Sr(t))  (19) 

This  feedback  signal  is  available  in  the  region  where  Zr  is  between  0.5  and  3.5. 
When  the  0r  is  close  to  a  multiple  of  7t  rad,  we  can  stabilize  the  phase  Sr(t)  at  zero  with  a 
proportional  feedback  control  using  the  feedback  signal  given  by  Eq.  (19).  This  feedback 
signal  if{t)  =  Af  sin£r(t)  is  fed  to  injection  current  of  the  LD.  Consequently,  the  phase 

change  introduced  by  the  external  varied  disturbance  is  compensated  by  the  change  in 
wavelength  AXf  =  pi  fit) . 

With  the  feedback  added  to  the  laser  diode  injection  current,  then  the  term  Sr(t)  in 
Eq.  (15)  is  actively  minimized  to  AS  as  follows: 

S,(t)  =  —Ad,---i-d,(AXw+AXr)  =  &S  (20) 

Using  the  same  method,  we  obtain  the  phase  fluctuation  <5i(r)  in  SYS1  as  follow: 


S1(/)  =  ^~Ad] 
A0 


4n 


d\  ( A/i id  +  A  Xf  ^ 


(21) 


By  substituting  Eq.  (20)  into  Eq.  (21),  5i(t)  can  be  expressed  as [4] 


S({t)  = 


Ait 


to  V 


M,  -j-Ad, 


+~AS 


(22) 


Note  the  same  light  source  LD  is  used  for  both  SYS1  and  SYS2,  and  assuming  that  both 
SYS1  and  SYS2  are  subject  to  the  same  disturbances,  then,  the  relationship 


Ad,  Adr 
d,  ~  d. 


(23) 


will  be  satisfied,  thus,  we  get  the  phase  £,(?) : 

S,(l)  =  j-AS 
dr 


(24) 
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Since  the  optical  path  difference  2di  is  not  equal  the  optical  path  difference  2dr,  the  phase 
fluctuation  <5j(?)  can  be  reduced  less  than  A5  if  selecting  dx  <  d,.. 


4.  Design  of  Real-time  Data  Processing  by  Computer 

Our  ultimate  goal  is  to  extract  in  real-time  distance  and  hence  the  phase  0  as 
precisely  as  possible,  over  as  wide  range  as  possible.  To  obtain  0  in  real-time,  the  required 
harmonic  signals  from  the  analog  harmonic  demodulator  are  measured  by  a  data 
acquisition  board  in  a  computer. 

The  signals  Yx  and  Y2  are  converted  to  12-bit  digital  value  by  a  National 
Instruments  model  AT-MIO-16E-2  data  acquisition  board.  The  AT-MIO-16E-2  board  is 
a  high-performance  multifunction  analog,  digital,  and  timing  VO  board  for  the  EBM-PC 
computers.  The  sampling  clock  for  A/D  converter  is  synchronized  with  the  modulation 
current  iJJ)  by  AT-MIO-16E-2  board  using  zero-cross  voltage  or  a  selected  fixed 

co 

voltage  values,  so  the  sampling  frequency  is  equal  to  the  modulation  frequency  — * — .  In 

2  71 

our  design  the  actual  input  range  and  the  least  significant  bit  (LSB)  of  this  acquisition 
board  are  selected  to  be,  ±  2.51^  and  1.22  mV,  respectively.  The  minimum  detectable 

In 

phase  (MDP)  can  be  determined  primarily  by  one  coded-phase  angle  —^yad .  Since  the 

phase  is  modulo  2x,  we  must  increment  or  decrement  the  phase  by  2n  each  time  the  phase 
crosses  the  positive  X  axis.  A  Lab  VIEW  program  was  used  to  do  this.  Once  data  is 
acquired,  a  Lab  VIEW  program  processes  the  data  according  to  the  theory  developed  in 
Section  2  and  the  displacement  is  displayed  by  the  computer.  In  an  actual  closed  loop 
system  the  feedback  logic  would  control  the  cutting  tool  motion. 

5.  The  Displacement  Actuators 

The  mirror  M2  was  mounted  onto  a  voltage  controlled  actuator.  Because  no  single 
device  could  give  us  high  resolution  and  wide  dynamic  range  we  have  used  two  distinct 
actuators.  First  for  high  resolution  displacement  measurements  the  mirror  M2  was  driven 
by  PZT  (Burleigh  PZ-80)  over  a  6pm  range.  It  should  be  noted  the  response  of  PZT  to  a 
linear  ramp  exhibited  5%  non-linearity  due  to  its  inherent  hysteresis^.  So  that,  after 
sampling,  A/D  conversion  and  calculating  displacement,  the  polynomial  fit  function  is  used 
to  give  the  mean  square  error  (mse).  This  system  was  also  used  for  measurements  of  the 
noise  of  electrical  circuit  for  a  fixed  measuring  point,  and  to  the  characterize  the 
performance  of  the  feedback  control  system. 

Second  for  wide  dynamic  range  measurements,  the  object  mirror  M2  was 
positioned  by  a  Picomotor  (New  Focus  Inc.)  micrometer-replacement  actuator  that  is 
specified  at  <0.  lnm  resolution  with  remote  control171.  In  our  system  the  Picomotor  was 
operated  under  computer  control  and  movejd_^£otal  displacement  of  more  than  1mm,  in 
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steps  not  less  than  10  nm.  At  each  step,  the  output  voltages  of  the  phase  demodulators 
were  sampled  and  converted  to  displacement  using  a  Lab  VIEW  Program  that  we  wrote 
for  these  measurements.  Then  displacements  are  shown  on  computer. 


6.  Experimental  Results 

The  experimental  setup  is  shown  in  Fig.  2.  The  system  contains  two  unbalanced 
Michelson  interferometers:  One  (SYS1)  is  used  for  displacement  measurement  and  the 
other  (SYS2)  is  used  for  the  feedback  control  to  reduce  the  effects  of  external 
disturbances.  The  optical  path  difference  between  the  two  arms  in  SYS2  is  2d,.  =  60mm, 
but,  in  SYS1  2di  are  set  to  different  values  for  the  two  different  systems.  The  PZT 
measurements  were  performed  with  2di  =  60mm  and  the  Picomotor  measurements  were 
performed  with  2di  =  1 10mm,  that  is,  the  initial  value  of  z  in  SYS  1  is  0.6  or  1 . 1 .  A 
GaAIAs  laser  diode  (SHARP  LT024MD0)  was  used.  Its’  maximum  output  power  is 
about  5  mW  and  the  central  wavelength  Xq  is  the  785mm.  The  modulation  efficiency  3  of 

the  LD  was  6.0  x  10  The  frequency  of  the  phase  modulation  —  was  set  to  10 

kHz.  The  cutoff  frequency  of  the  LPF  is  set  to  1  kHz  to  essentially  eliminate  the 
modulation  frequency  and  it’s  harmonics. 

As  stated  above,  the  final  conversion  of  the  output  voltages  from  the  demodulator 
to  an  actual  displacement  measurement  and  data  analysis  are  done  in  software  by 
computer.  The  object  mirror  M2  is  mounted  on  PZT  (model  PZ-80,  Burleigh)  or 
translation  stage  moved  by  a  Picomotor  driver  (New  Focus,  model  8801). 

Now  we  show  some  experimental  results  using  the  setup  shown  in  Fig.  2. 

First,  we  checked  the  influence  of  the  electronic  noise  on  the  measurement  system 
by  running  a  series  of  tests  with  no  input  signal  to  the  harmonic  demodulator.  Fig.  3 
shows  a  plot  of  the  electronic  noise  influence  over  a  long  period  of  time.  As  can  be 
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Figure  3.  Measured  displacement  error  due  to  electronic  noise. 


seen  in  Fig.  3.  the  electronics  noise  contributes  a  measurement  error  standard  deviation  of 
0.13  nm.  But,  the  minimum  detectable  phase  (MDP)  in  our  system  can  be  determined 

"In 

primarily  by  one  coded-phase  angle  — jy  rad.  Comparing  this  to  the  maximum  spatial 


resolution  should  be 


ivauiukiuit  jiivuiu  l/v  n  l  _  i  \j  .  x 
2  \2) 

small  uncertainty  to  the  measurement 


0. 1  nm,  it  is  clear  that  the  electronics  noise  only  adds  a 


Next,  measurements  at  a  single  point  were  done  with  and  without  feedback,  with  and 
without  acoustic  noise,  with  and  without  air  flow,  etc.  Figs.  4  and  5  show  two  of  these 
experiments.  Comparing  with  Figs.  4  and  Fig.  5,  we  can  see  that  the  dc  drift  in 
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Figure  4.  Single  Point  Measurement  with  Acoustic  Noise  and  no  Feedback 


Fig.  4  is  almost  completely  suppressed  when  feedback  control  was  introduced,  and,  the 
standard  deviation  was  reduced  to  ~  1.3nm. 


Figure  5.  Single  Point  Measurement  with  Acoustic  Noise  and  Feedback. 
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Third,  Fig.  6  shows  that  a  large  displacement,  more  than  4  times  can  be  measured 

with  high  accuracy.  In  fact,  it  follows  the  nonlinear  characteristic  of  the  PZT  with  an 
accuracy  of 


Coefficients 

iO  1.803  58.866  -1.477  0.X7  -0.021  0.001 


Figure  6.  PZT  movement  over  250  volts. 

better  than  5nm  as  shown  in  Fig.  6,  where  we  used  the  polynomial  fit  function  from  the 
program  to  correct  for  the  non-linearity  of  the  PZT.  The  accuracy  of  our  system  was 
determined  by  taking  repeated  sets  of  data  and  computing  the  square  root  of  the  mean 
square  error  (mse)  given  by  the  polynomial  fit  function  from  the  program. 

Finally,  we  used  a  Picomotor  instead  of  PZT  to  investigate  the  characteristic  of  the 
experiment  system  at  larger  range.  With  this  system  we  demonstrated  high  accuracy  over 
0.2  mm  of  dynamic  range. 

In  these  experiments,  we  delayed  the  measurement  by  one  second  after  each  step 
in  order  to  allow  the  mirror  hardware  to  damp  out  vibrations  after  each  step.  The  results 
of  a  linear  fit  function  with  the  slope  and  mean  square  error  calculated.  The  total 
displacement  was  19130  nm  or  0.191  mm,  the  standard  deviation  of  each  point  was 
44.7nm.  This  large  standard  deviation  is  a  result  of  the  fact  that  the  Picomotor 
periodically  does  not  move  at  all,  when  a  signal  is  applied.  This  performance  is  within  the 
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manufacturers  specifications.  The  measurements  also,  clearly  shows  the  periodic  non¬ 
linearity’s  in  the  motion  of  the  Picomotor  when  the  screw  is  rotated  by  2n. 

For  calibration  of  the  wide  dynamic  range  measurements  a  HEIDENHAIN 
(Control  Technics  Cop.)  was  used  also  used  to  measure  the  distance  traveled  and  the 
results  compared  to  our  WDR-SPM  systems  calculations,  the  results  agreed  within  the 
accuracy  of  the  HEIDENHAIN  gauge  within  gauge’s  0. 1  um  resolution.  Long  term 
temperature  drifts  caused  the  HEIDENHAIN  gauge  reading  to  vary  by  ±  0.4  pm,  for  large 
distances. 

We  also  measured  the  limits  of  the  dynamic  range  of  our  experimental  system  and 
found  it  to  be  1mm  with  the  hardware  that  we  have.  However,  the  tail,  which  consists  of 
the  largest  distance  measurements  point  clearly  deviates  from  a  linear  fit.  This  is  due  to  the 
limitations  of  the  picomotor,  and  can  easily  be  extended  using  a  precision  stepper  motor 
system. 


7.  Discussion 

7.1  Permitted  Step  Displacement  Using  Picomotor  Actuator 

Since  the  Picomotor  actuator  is  a  mechanical  positioner  and  the  mechanical  stepping 
causes  some  vibration,  we  delayed  the  beginning  of  the  A/D  board  sampling  by  1  second 
so  that  the  mechanical  vibrations  were  damped  out.  This  delay  time  can  be  easily  changed 
in  Lab  VIEW  program.  Once  the  delay  time  has  been  selected  a  0.4  volt  pulse  is  sent  to 
the  Picomotor  this  corresponded  to  a  minimum  step  displacement  of  about  15  nm  and  an 
average  displacement  of  about  20  nm.  In  our  program,  we  used  a  20  nm  step  size  for  our 
linear  fit. 


7.2  Limitations  of  dynamic  range 

As  mentioned  previously,  the  phase  demodulator  was  designed  to  take  two  signal  outputs 
which  were  proportional  to  the  instantaneous  sine  and  cosine  of  the  phase  angle  of  the 
difference  optical  path  between  two  arms  in  SYS  1.  When  mirror  M2  was  moved,  the 
amplitudes  of  sine  or  cosine  scaled  proportionally  to  Jx  (z)  or  J2  (z),  respectively.  This 
method  may  be  used  for  Ji  (z)  and  J2  (z)  as  long  as,  z,  is  between  0.5  and  3.5.  Assuming 
that  a  =  0. 1  ma,  P  =  6  x  10'3  nm/mA  and  Ao  =  785  nm,  it  is  possible  to  obtain  a  dynamic 
range  of  a  few  tens  of  centimeters. 

Because  the  final  conversion  of  the  output  voltages  from  the  demodulators  to  an 
actual  displacement  is  done  in  software  by  computer,  and  there  always  is  a  smallest 
detectable  change  in  voltage  on  a  data  acquisition  board  (the  change  in  voltage  represents 
a  least  significant  bit  (LSB)  of  the  digital  value).  For  example,  at  our  system,  the  LSB  is 
required  a  1.22  mV.  If  either  output  signal^ei|jier  sin  or  cosine)  from  the  demodulator  is 
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reduced  less  than  1.22  mV,  no  change  will  be  detected.  We  also  observed  that  the  signal 
of  the  first  harmonic  component ,  falls  off  by  a  factor  of  3  relative  to  it’s  initial  value, 
then  the  signal  was  then  below  the  sensitivity  of  the  A/D  ,  thus  the  software  didn’t 
correctly  calculate  displacement. 

Therefore,  the  maximum  range  is  determined  by  the  photo-current  on  a  photodetector, 
gain  of  demodulator,  and  LSB  on  a  data  acquisition  board.  In  this  case  we  were  limited  by 
the  demodulator  and  if  we  increased  the  gain  of  the  demodulator  the  maximum  dynamic 
range  will  be  increased  for  our  experimental  system. 


8.  Conclusion 


We  developed  the  first  wide  dynamic  range  SPM-LD  interferometer  that  uses  a  slowly- 
varying  approximation  and  built  a  real-time  displacement  measurement  system  with  a  local 

271 

resolution  of  — jj-  under  normal  laboratory  conditions.  Based  on  this  development  of 
2 


theory,  it  is  not  necessary  to  keep  the  amplitudes  of  the  first,  the  second  harmonic 
component  equal  in  experiment,  so  it  is  easy  to  implement  displacement  measurement  with 
1mm  range.  Due  to  use  feedback  control  system  to  reduce  external  disturbances,  this 
method  is  insensitive  to  the  wavelength  fluctuation  of  LD,  the  gain  drift,  and  the 
temperature  change  or  vibration  of  external  circumstance.  The  standard  deviation  of  5nm 
for  PZT,  the  dynamic  range  of  1  mm  for  Picomotor  actuator  has  been  demonstrated  with 
the  same  experimental  system.  The  use  of  the  Lab  VIEW  software  for  data  acquisition  and 
instrument  control,  made  it  relatively  easy  to  implement  a  real-time  displacement 
measurement.  Our  calibration  for  Picomotor  is  more  directly  and  accuracy  than  Ref.  [7], 
If  LPF’s  gain  can  be  improved  then  the  dynamic  range  can  be  increased  to  10. 

This  is  the  first  wide  dynamic  range  high  resolution  distance  measuring  system  that 
can  tolerate  vibrations  of  several  hundred  nm.  The  system  is  suitable  for  real  time  closed 
loop  operation  in  a  high  precision  machining  environment. 
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CLEO  ’97.  It  is  in  preparation  and  will  be  submitted  upon  presentation. 
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